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DESCRIPTION 
PROTEIN KINASES 

FIELD OF THE INVENTION 
The present invention relates to novel kinase polypeptides, nucleotide sequences 
encoding the novel kinase polypeptides, as well as various products and methods useful 
for the diagnosis and treatment of various kinase-related diseases and conditions. 

BACKGROUND OF THE INVENTION 
The following description of the background of the invention is provided to aid in 
understanding the invention, but is not admitted to be or to describe prior art to the 
invention. 

Cellular signal transduction is a fundamental mechanism whereby external stimuli 
that regulate diverse cellular processes are relayed to the interior of cells. One of the key 
biochemical mechanisms of signal transduction involves the reversible phosphorylation of 
proteins, which enables regulation of the activity of mature proteins by altering their 
structure and function. 

Protein phosphorylation plays a pivotal role in biological signal transduction. 
Among the biological functions controlled by protein phosphorylation are the following: 
cell division; differentiation and death (apoptosis); cell motiUty and cytoskeletal structure; 
control of DNA replication, transcription, splicing and translation; protein translocation 
events ftom the endoplasmic reticulum and Golgi apparatus to the membrane and 
extracellular space; protein nuclear import and export; regulation of metabolic reactions, 
etc. Abnormal protein phosphorylation is widely recognized to be causally linked to the 
etiology of many diseases including cancer as well as immunologic, neuronal and 
metabolic disorders. 

The most common phospho-acceptor amino acid residues are serine, threonine and 
tyrosine. Phosphorylation in histidine has also been observed in bacteria. The presence of 
a phosphate moeity modulates protein fimction in multiple ways. A common mechanism 
includes changes in the catalytic properties (Vmax and K^) of an enzyme leading to its 
activation or inactivation. A second widely recognized mechanism involves promoting 
protein-protein interactions. An example of this is the tyrosine autophosphorylation of the 
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ligand-activated EGF receptor tyrosine kinase. This event triggers the high-affinity 
binding to the phosphotyrosine residue on the receptor's C-terminal intracellular domain 
to the SH2 motif of the adaptor molecule Grb2. Grb2 in turn binds through its SH3 motif 
to a second adaptor molecule, such as SHC. The formation of this ternary complex 

5 acivates the signaling events that are responsible for the biological effects of EGF. Serine 
and threonine phosphorylation events have also being recently recognized to exert their 
biological function through protein-protein interaction events mediated by the high- 
affinity binding of phosphoserine and phosphothreonine to WW motifs present in a large 
variety of proteins (Lu, PJ. et al (1999) Science 283:1325-1328). A third important 

10 outcome of protein phosphorylation is changes in the subcellular localization of the 

substrate. As an example, nuclear import and export events in a large diversity of proteins 
are regulated by protein phosphorylation (Drier E.A. et al (1999) Genes Dev 13: 556- 
568). 

Protein kinases are one of the largest famiUes of eukaryotic proteins with several 

15 hundred known members. These proteins share a 250-300 amino acid domain that can be 

subdivided into 12 distinct subdomains that comprise the common catalytic core structure. 
These conserved protein motifs have recently been exploited using PCR-based and 
bioinformatic strategies leading to a significant expansion of the known kinases. Multiple 
alignment of the sequences in the catalytic domain of protein kinases and subsequent 

20 parsimony analysis permits their segregation into a dendrogram reflecting the relatedness 
of their catalytic domains (Fig. 1). In this manner, related kinases are clustered into 
distinct branches or subfamilies including: tyrosine kinases, cyclic-nucleotide-dependent 
kinases, calcium/cahnodulin kinases, cyclin-dependent kinases and MAP-kinases, serine- 
threonine kinase receptors, and several other less defined subfamilies. 

25 We have recently completed a systematic analysis of the protein kinases present in 

C elegansy the multicellular organism whose entire DNA sequence has been determined. 
We identified 473 unique kinase profiles including 398 full-length conventional kinases, 
and 20 additional proteins that may function as atypical protein kinases. (Plowman G.D. 
et al (1999), Proc. NaU. Acad. Sci. 96:13603-13610). 

30 Using parsimony analysis, the protein kinases may be divided into 4 major groups: 

AGC, CAMK, CMGC and tyrosine kinases. In addition, there are a number of minor yet 
distinct families, including the STE and casein kinase 1, families related to worm- or 
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fimgal-specific kinases, and a family designated "other" to represent several smaller 
families. In addition, we designate an "atypical" family to represent protein kinases whose 
catalytic domain has little or no primary sequence homology to conventional kinases, 
including the A6 kinases and PD kinases. 

The AGC kinases are basic amino acid-directed enzymes that phosphorylate 
residues found proximal to Arg and Lys. Examples of this group are the cyclic nucleotide- 
dependent kinases, G protein kinases, NDR or DBF2 and the ribosomal S6 kinases. 

The CAMK group kinases are also basic amino acid-directed kinases. They include 
the Ca2+/calmoduhn-regulated and AMP-dependent protein kinases, myosin light chain 
kinases, checkpoint 2 kinases (CHK2) and EMK-related protein kinases. The EMK family 
of STK are involved in the control of cell polarity, micotubule stability and cancer. One 
member of the EMK family, C-TAKl has been reported to control entry into mitosis by 
activating Cdc25C which in turn dephosphorylates Cdc2. 

CMGC group kinases are "proline-directed" enzymes phosphorylating residues 
that exist in a proline-rich context. They include the cyclin-dependent kinases (CDKs), 
mitogen-activated kinases (MAPKs). GSK3s and CLKs. Most CMGC kinases have 
larger-than-average kinase domains owing to the presraice of insertions within subdomains 
X and XL 

The tyrosine kinase group encompass both cytoplasmic (i.e. src) as well as 
transmembrane receptor tyrosine kinases (i.e. EGF receptor). These kinases play a pivotal 
role in the signal transduction processes that mediate cell proliferation, differentiation and 
qjoptotis. 

Group members that define smaller, yet distinct ph>4ogenetic branches of 
conventional kinases include the elongation factor 2 kinases (EIFKs); homologues of the 
yeast sterile family kinases (STE) which refers to 3 classes of kinases which Ue 
sequentially upstream of the MAPKs; mixed lineage kinases (MLXs); Lim-domain 
containing kinases (LIMKs); Calcium-cabnodulin kinase kinases (CAMKK), dual-specific 
tyrosine kinases (DYRK), integiin receptor associated kinase (IRAK); testis-specific 
kinases (TSK); UNC-5 1 related kinases (UNO); several families that are close 
homologues to worm (C26C2.1, YQ09, ZC581.9. YFL033c, C24A1.3), Drosophila 
(SLOB), or yeast (YDOD_sp, YGR262_sc) kinases, and others that are "unique" and 
don't cluster into any obvious family. 
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SUMMARY OF THE INVENTION 
Through a search of the EST database for homologies to the conserved catalytic 
kinase domain of protein kinases, hundreds of mammalian members of known and 
previously unidentified protein kinase families and groups have been identified as part of 
the present invention. Multiple alignment and parsimony analysis of the catalytic domain 
reveals that approximately half of these protein kinases cluster into 10 known groups, with 
the other half perhiqjs defming novel groups. Classification in this manner has proven 
highly accurate not only in predicting motifs present in the remaining non-catalytic portion 
of each protein, but also in their regulation, substrates, and signaling pathways. The 
present invention includes the partial or complete sequence of new protein kinases, their 
classification, predicted or deduced protein structure, and a strategy for elucidating their 
biologic and therapeutic relevance. 

Thus, a first aspect of the invention features an isolated, enriched, or purified 
nucleic acid molecule encoding a kinase polypeptide selected from the group consisting 
SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, 
SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129. SEQ ID NO:130. SEQ ID N0:131, 
SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, 
SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, 
SEQ ID N0:142, SEQ ID NO:143, SEQ ID N0:144, SEQ ID NO:145, SEQ ID NO:146, 
SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID N0:151, 
SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, 
SEQ ID NO:157, SEQ ID NO:158, SEQID NO:159, SEQ ID NO:160, SEQ ID N0:161, 
SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID N0:165. SEQ ID NO:166, 
SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169. SEQ ID NO:170, SEQ ID N0:171, 
SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, 
SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID N0:181, 
SEQ ED NO:182, SEQ ID NO:183, SEQ ID N0:184, SEQ ID NO:185, SEQ ID NO:186, 
SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID N0:191, 
SEQ ID NO:199, SEQ ID NO:193, SEQ ED NO:194, SEQ ID NO:195, SEQ ID NO:196, 
SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201 , 
SEQ ID NO:202. SEQ ID NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ID NO:206, 
SEQ ID NO:207, SEQ ID NO:208, SEQ ED NO:209, SEQ ID NO:210, SEQ ID N0:21 1. 
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SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, 
SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, 
SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226. 
SEQ E) NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, 
5 SEQ ID NO:232. SEQ ID NO:233» SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, 
SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, 
aiidSEQIDNO:242. 

By "isolated" in reference to nucleic acid is meant a polymer of nucleotides 
conjugated to each other, including DNA and KNA, that is isolated from a natural source 

10 or that is synthesized. The isolated nucleic acid of the present invention is unique in the 
sense that it is not found in a pure or separated state in nature. Use of the tenn "isolated" 
indicates that a naturally occurring sequence has been removed from its normal cellular 
chromosomal) environment. Thus, the sequence may be in a cell-free solution or 
placed in a different cellular environment. The term does not imply that the sequence is 

15 the only nucleotide chain present, but that it is essentially free (about 90 - 95% pure at 
least) of non-nucleotide material naturally associated with it, and thus is distinguished 
from isolated chromosomes. 

By the use of the term "enriched" in reference to nucleic acid is meant that the 
specific DNA or RNA sequence constitutes a significantly higher fraction (2-5 fold) of 

20 the total DNA or RNA present in the cells or solution of interest than in normal or . 

diseased cells or in the cells from which the sequence was taken. This could be caused by 
a person by preferential reduction in the amount of other DNA or RNA present, or by a 
preferential increase in the amount of the specific DNA or RNA sequence, or by a 
combination of the two. However, it should be noted that enriched does not imply that 

25 there are no other DNA or RNA sequences present, just that the relative amount of the 
sequence of interest has been significantly increased. The term "significant" is used to 
indicate that the level of increase is usefiil to the person making such an increase, and 
generally means an increase relative to other nucleic acids of about at least 2 fold, more 
preferably at least 5 to 10 fold or even more. The term also does not imply that there is no 

30 DNA or RNA bom other sources. The other source DNA may, for example, comprise 
DNA from a yeast or bacterial genome, or a cloning vector such as pUC19. This term 
distinguishes from naturally occurring events, such as viral infection, or tumor type 
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growths, in which the level of one mRNA may be naturally increased relative to other 
species of mRNA. That is, the term is meant to cover only those situations in which a 
person has intervened to elevate the proportion of the desired nucleic acid. 

It is also advantageous for some purposes that a nucleotide sequence be in purified 
5 form. The term **purified" in reference to nucleic acid does not require absolute purity 
(such as a homogeneous preparation). Instead, it represents an indication that the 
sequence is relatively more pure than in the natural environment (compared to the natural 
level this level should be at least 2-5 fold greater, e,^., in terms of mg/mL). Individual 
clones isolated from a cDNA library may be purified to electrophoretic homogeneity. The 

10 claimed DNA molecules obtained from these clones could be obtained directly fixsm total 
DNA or from total RNA. The cDNA clones are not naturally occurring, but rather are 
preferably obtained via manipulation of a partially purified naturally occurring substance 
(messenger RNA). The construction of a cDNA library from mRNA involves the creation 
of a synthetic substance (cDNA) and pure individual cDNA clones can be isolated from 

1 5 the synthetic library by clonal selection of the cells carrying the cDNA library. Thus, the 
process which includes the construction of a cDNA library from mRNA and isolation of 
distinct cDNA clones yields an approximately 10^-fold purification of the native message. 
Thus, purification of at least one order of magnitude, preferably two or three orders, and 
more preferably four or five orders of magnitude is expressly contemplated. 

20 By a **kinase polypeptide" is meant 10 (preferably 20, more preferably 40, most 

preferably 75) or more contiguous amino acids set forth in an amino acid sequence 
selected from the group consisting of those set forth in SEQ ID NO:122, SEQ ID NO:123, 
SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, 
SEQ ID NO:129, SEQ ID NO:130, SEQ ED N0:131, SEQ ID NO:132, SEQ ID NO:133, 

25 SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, 
SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:14l, SEQ ID NO:142, SEQ ID NO:143, 
SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, 
SEQ ID NO:149, SEQ ID NO:150, SEQ ID N0:151, SEQ ID NO:152, SEQ ID NO:153, 
SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, 

30 SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID NO:163, 
SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:l67, SEQ ID NO:168, 
SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173, 
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SEQ ID NO:174. SEQ ID NO:I75, SEQ ID N0:176, SEQ ID NO:177, SEQ ID NO:178. 
SEQ ED NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, 
SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, 
SEQ ID NO:189, SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, 

5 SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, 
SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, 
SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, 
SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213, 
SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, 

10 SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, 
SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, 
SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, 
SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, 
SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, or functional 

1 5 derivatives tiiereof as described herein. For sequences for which the full-length sequence 
is not given, the remaining sequences can be determined using methods well-known to 
tiiose in the art and are intended to be included in the invention. In certain aspects, 
polypeptides of 100, 200, 300 or more amino acids are preferred. The kinase polypeptide 
can be encoded by a full-length nucleic acid sequence or any portion of the fixll-length 

20 nucleic acid sequence, so long as a functional activity of the polypeptide is retained. By 
"functional" domain is meant any region of the polypeptide that may play a regulatory or 
catalytic role as predicted fix>m amino acid sequence homology to other proteins or by the 
presence of amino acid sequences that may give rise to specific structural conformations 
(i.e., coiled-coils). For some purposes, polypeptide domains are preferred, including, but 

25 not limited to, N-terminal, catalytic/kinase and C-terminal. 

The amino acid sequence will be substantially similar to a sequence selected from 
the group consisting of those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID 
NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID 
NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID NO:132. SEQ ID NO:133, SEQ ID 

30 NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID 
NO:139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID 
NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID 
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NO:149, SEQ ID NO:150, SEQ ID N0:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID 
NO:154. SEQ ID NO:155. SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID 
NO:159, SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID 
NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID 

5 NO:169, SEQ ID N0:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID 
NO:174. SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:I78, SEQ ID 
NO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID 
NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID 
NO:189, SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID 

10 NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID 
NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID 
NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID 
NO:209, SEQ ID NO:210, SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID 
NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID 

15 NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID 
NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID 
NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO;233, SEQ ID 
NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID 
NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242, or the corresponding 

20 full-length amino acid sequence, or ftagments thereof. A sequence that is substantially 
similar to a sequence selected from the group consisting of those set forth in SEQ ID 
NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
NO:127, SEQ IDNO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID 
NO:132, SEQ IDNO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID 

25 NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ ID 
NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ED 
NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID N0:151, SEQ ID 
NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID 
NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161, SEQ ID 

30 NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166. SEQ ID 
NO:167, SEQ IDNO:168, SEQ ID NO:169. SEQ ID NO:170, SEQ ID N0:171, SEQ ID 
NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID 
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NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID 
NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID 
NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID 
NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID 
5 NO:197,SEQIDNO:198,SEQIDNO:199,SEQIDNO:200,SEQIDNO:201, SEQ ID 
NO:202, SBQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210. SEQ ID N0:21 1, SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 

1 0 NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227. SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO;230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239. SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242 will have at least 75% identity (preferably 90%, more preferably at least 95% 

15 and most preferably 99-100%) to a sequence selected from the group consisting of those 
set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ 
ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID N0:130, SEQ 
ID N0:131, SEQ IDNO:132, SEQ IDNO:133, SEQ IDNO:134, SEQ ID NO:135, SEQ 
ID NO:136, SEQ IDNO:137, SEQ IDNO:138, SEQ IDNO:139, SEQ ID NO:140, SEQ 

20 ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ED NO:144, SEQ ID NO:145, SEQ 
ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ 
IDN0:151, SEQ IDNO:152, SEQ IDNO:153, SEQ ID NO:154. SEQ ID NO:155, SEQ 
ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ 
ID N0:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID N0:164, SEQ ID NO:165. SEQ 

25 ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID N0:170, SEQ 
ID N0:171. SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:I75, SEQ 
ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ 
ID N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ 
ID NO:186, SEQ ID NO:187, SEQ ID NO:188. SEQ ID NO:189, SEQ ID NO:190, SEQ 

30 ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ 
ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ 
ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ 
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ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ 
ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:2l4, SEQ ID NO:215, SEQ 
ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ 
ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ 
5 ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 
ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:241, and SEQ ID NO:242 or portions of or the entire corresponding full-length 
amino acid sequences. 

10 By "identity" is meant a property of sequences that measures their similarity or 

relationship. Identity is measured by dividing the number of identical residues between 
two sequences (either full-length or a defined domain) by the total nimiber of residues in 
the known sequence, or the domain of the known sequence, and multiplying the product 
by 100. Thus, two copies of exactly the same sequence have 100% identity, but sequences 

15 that are less highly conserved, and have replacements and substitutions, have a lower 

degree of identity. "Gaps" are spaces in an alignment that can result firom aligning a novel 
sequence with a known sequence when the novel sequence has additions or deletions of 
amino acids in comparison with the known sequence. These gaps do not factor into the 
assessment of % identity using the sbove calculation, 

20 Those skilled in the art will recognize that several computer programs are also 

available for determining sequence identity using standard parameters, for example. Blast 
(Altschul, et al (1997) Nucleic Acids Res. 25:3389-3402), Blast2 (Altschul, et al (1990) 
L MoL Biol. 215:403-410), and Smith-Waterman (Smith, et al (1981) J. Mol. Biol. 
147:195-197). 

25 In preferred embodiments, the invention features isolated, enriched, or purified 

nucleic acid molecules encoding a kinase polypeptide comprising a nucleotide sequence 
that: (a) encodes a polypeptide having an amino acid sequence selected from the group 
consisting of those set forth in SEQ ID NO:122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ 
ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO: 129, SEQ 

30 ID NO:130, SEQ ID NO:131, SEQ ID NO;132, SEQ ID NO:133, SEQ ID NO:134, SEQ 
ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ 
rDNO:140, SEQ ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ 
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ID NO:145, SEQ ID NO:146, SEQ ED NO:147, SEQ ID NO:148, SEQ ED NO:149, SEQ 
ID NO:150, SEQ ID N0:151, SEQ ID NO:152. SEQ ID NO:I53, SEQ ID NO:154, SEQ 
ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ 
ID NO:160, SEQ ED N0:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ 
5 ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ 
ID NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ 
ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ 
ID NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ 
ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ 

10 ID NO:I90, SEQ ID NO:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ 
ID NO:195. SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ 
ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ 
ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ 
ID NO:210, SEQ ID N0:21 1. SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ 

15 ID NO:215, SEQ ID NO:2I6, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219. SEQ 
ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ 
ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ 
ID NO:230, SEQ ID NO:23l, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ 
ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ 

20 ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, or the corresponding ftill-length 

amino acid sequence, or fragments thereof. A sequence that is substantiaUy similar to a 
sequence selected from the group consisting of those set forth in SEQ ID NO:122, SEQ ID 
NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID 
NO:128, SEQ ID NO:129, SEQ IDNO:130, SEQ ID N0:131. SEQ ID NO:132, SEQ ED 

25 NO:133, SEQ ID NO:134, SEQ ED NO:135, SEQ ID NO:136, SEQ ID NO: 137, SEQ ID 
NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO:148, SEQ IDNO:149, SEQ IDNO:150, SEQ IDN0:151, SEQ ID NO:152, SEQ ID 
NO:153, SEQ IDNO:154, SEQ IDNO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID 

30 NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162. SEQ ID 
NO:163, SEQ IDNO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID 
N0:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID 
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NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID 
NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID 
NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID 
NO:I88, SEQ ID NO:189, SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID 
5 NO:I93, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID 
NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID N0:201, SEQ ID NO:202. SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209. SEQ ID NO:210. SEQ ID N0:211, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214. SEQ ID NO:215. SEQID NO:216, SEQ ID NO:217, SEQ ID 

10 NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID N0.222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID N0:241, and SEQ ID NO:242 will 

1 5 have at least 75% identity (preferably 90%, more prefwably at least 95% and most 

preferably 99-100%) to the sequence selected from the group consisting of those set forth 
in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID 
NO:126, SEQ ID NO:127, SEQ ID NO: 128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID 
N0:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID 

20 NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID 
N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID 
NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID 
N0:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID N0:155, SEQ ID 
NO:156, SEQ ID NO:157, SEQ ID NO:158. SEQ ID NO:159, SEQ ID NO:160, SEQ ID 

25 N0:161, SEQ ID NO:162, SEQ ID NO:163, SEQ K) NO:164, SEQ ID NO:165. SEQ ID 
NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID 
N0:171, SEQ ID NO:172. SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID 
NO:176, SEQID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180. SEQ ID 
N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID 

30 NO: 186, SEQ ID NO: 187, SEQ ID NO: 188. SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID 
N0:191, SEQ ID NO: 199. SEQ ID NO: 193, SEQ ID NO: 194, SEQ ID NO: 195, SEQ ID 
NO:196, SEQ ID NO:197, SEQ ID NO: 198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID 
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NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID 
NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID 
N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID 
NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID 
5 NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID 
NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID 
NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID 
NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID 
NO:241, and SEQ ID NO:242; (b) is the complement of the nucleotide sequence of (a); (c) 

10 hybridizes under highly stringent conditions to the nucleotide molecule of (a) and encodes 
a naturally occurring kinase polyp^tide; (d) encodes a kinase polypeptide having an 
amino acid sequence selected from the group consisting of those set forth in SEQ ID 
NO:122, SEQ ID NO:123. SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID 

15 NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID 
NO:137, SEQ ID NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID N0:141, SEQ ID 
NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:151, SEQ ID 
NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID 

20 NO: 1 57, SEQ ID NO: 1 58, SEQ ID NO: 159, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ID 
NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID 
NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO:171, SEQ ID 
NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID 
NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID 

25 NO: 182, SEQ ED NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID 
NO:187, SEQ IDNO:188, SEQ IDNO:189, SEQ ID NO:190, SEQ ID N0:191, SEQ ID 
NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ED NO:195, SEQ ID NO:196, SEQ ID 
NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 

30 NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1 , SEQ ID 
NO:212, SEQ ID N0:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID N0:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
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NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225. SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ED 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
5 ID NO:242, or the corresponding fiall-length amino acid sequence, or firagments thereof. 

A sequence that is substantially similar to a sequence selected finom the group consisting of 
those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, 
SEQ ID NO:I26, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130. 
SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ ED NO: 134, SEQ ID NO: 135, 

10 SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, 
SEQ ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, 
SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, 
SEQ ID N0:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, 
SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, 

15 SEQ ID N0:161, SEQ IDNO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. 
SEQ ID NO:166, SEQ IDNO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, 
SEQ ID NO: 171, SEQ ID NO:172, SEQ ID NO: 173, SEQ ID NO: 174. SEQ ID NO: 175, 
SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, 
SEQ ID N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, 

20 SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188. SEQ ID NO:189, SEQ ID NO:190, 
SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQID NO:194, SEQ ID NO:195, 
SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, 
SEQ ID NO:201. SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, 
SEQ ID NO:206. SEQ ED NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, 

25 SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215. 
SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, 
SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, 
SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ED NO:230. 
SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ED.NO:234, SEQ ID NO:235.. 

30 SEQ ED NO:236, SEQ ED NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, 
SEQ ID NO:241, and SEQ ID NO:242 will have at least 75% identity (preferably 90%, 
more preferably at least 95% and most preferably 99-100%) to the sequence of SEQ ID 
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NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ED NO:125, SEQ ID NO:126, SEQ ID 
NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID 
NO:132, SEQ IDNO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID 
NO:137, SEQ IDNO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ ID 
5 NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID N0:151, SEQ ID 
NO:152, SEQ IDNO:153. SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ED 
NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID 
NO:162, SEQ ID N0:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID 

10 NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ED 
NO:172, SEQ ID NO:173, SEQ ID NO:174. SEQ ID N0:I75, SEQ ID NO:176, SEQ ID 
NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID 
NO:182, SEQ ID NO:183, SEQ ID N0:184, SEQ ID N0:185, SEQ ID NO:186, SEQ ID 
NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID N0:191, SEQ ID 

15 NO:199, SEQ ID NO:193, SEQ ID N0:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ED 
NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO;202, SEQ ED NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ED NO:210, SEQ ID N0:21 1, SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215. SEQ ID NO:216, SEQ ID 

20 NO:217, SEQ ED N0:21 8, SEQ ID NO:219, SEQ ID NO:220, SEQ ED NO:221 , SEQ ID . 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234. SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ED NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ 

25 ED NO:242, except that it lacks one or more, but not all, of a domain selected from the 
group consisting of anN-terminal domain, a catalytic domain, a C-terminal domain, a 
coiled-coil structure region, a proline-rich region, a spacer region, an insert, and a C- 
terminal tail; (e) is the complement of the nucleotide sequence of (d); (f) encodes a 
polypeptide having an amino acid sequence selected from the group consisting of those set 

30 forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID 
NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID 
N0:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID 
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NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID 
N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID 
NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID 
N0:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID 
5 NO:156. SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID 
N0:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID 
NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID 
NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID 
NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID 

10 NO: 181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID 
NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID 
N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID 
NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID 
NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID 

1 5 NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID N0:2 1 0, SEQ ID 
N0:211, SEQ ID N0:212, SEQ ID N0:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID 
NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ED NO:220, SEQ ID 
NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID 
NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ED NO:230, SEQ ID 

20 NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID 
NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID 
NO:241, and SEQ ED NO:242, or the corresponding full-length amino acid sequence, or 
fragments thereof. (The domain demarcations of the polypeptides of the invention are 
indicated in Table 2 by reference to the kinase domain.) A sequence that is substantially 

25 similar to a sequence selected from the group consisting of those set forth in SEQ ID 

NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126. SEQ ID 
NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ED NO:130, SEQ ID N0:131, SEQ ID 
NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID 
NO:137, SEQ ID NO:138, SEQ ID NO:I39, SEQ ID NO:140, SEQ ID N0:141, SEQ ID 

30 NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ED 
NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID N0:151, SEQ ID 
NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID 
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NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:16I, SEQ ID 
NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID 
NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ID 
NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID 

5 NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID N0;18l, SEQ ID 
NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186. SEQ ID 
NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID NO:191, SEQ ID 
NO:199, SEQ ID NO:193. SEQ ID NO:194, SEQ ID NO:195. SEQ ID NO:196, SEQ ID 
NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ED NO:201, SEQ ID 

10 NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:211, SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 

15 NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242 will have at least 75% identity (preferably 90%, more preferably at least 95% 
and most preferably 99-100%) to the sequence selected from the group consisting of those 

20 set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ 
ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ 
ID N0:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ 
ID NO:136, SEQ ID NO:137, SEQ IDNO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ 
ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ 

25 IDNO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ 
ID NO:151, SEQ ID NO:I52, SEQ IDNO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ 
ID NO:156, SEQ ID NO:157, SEQ IDNO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ 
ED N0:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ 
ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ 

30 ID N0:171. SEQ ID NO:172, SEQ IDNO:173, SEQ ID NO:174, SEQ ID NO:175. SEQ 
ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 180. SEQ 
ID N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ 
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ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ 
ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ 
ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ 
ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ 

5 ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ 
ID N0:211, SEQ ID NO:212. SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ 
ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ 
ID NO:221, SEQ ID NO:222, SEQ ID NO:223. SEQ ID NO:224, SEQ ID NO:225, SEQ 
ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 

10 ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
ID NO:236, SEQ ID NO:237. SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:241, and SEQ ID NO:242; (b) is the complement of the nucleotide sequence of (a); 
(c) hybridizes under highly stringent conditions to the nucleotide molecule of (a) and 
encodes a naturally occurring kinase polypeptide; (d) encodes a kinase polypeptide having 

15 an amino acid sequence selected fix)m the group consisting of those set forth in SEQ ID 
NO:122. SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
NO:127. SEQ IDNO:128, SEQ ID NO:129, SEQ IDNO:I30, SEQ ID N0:131. SEQ ID 
NO:132, SEQ IDNO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID 
NO:137, SEQ IDNO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO: 141, SEQ ID 

20 NO:142, SEQ IDNO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO:147, SEQ n)NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:151, SEQ ID 
NO:152, SEQ IDNO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID 
NO:157, SEQ IDNO:158, SEQ IDNO:159, SEQ ID NO:160, SEQ IDNO:161, SEQ ID 
NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ JD NO:166. SEQ ID 

25 NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170. SEQ ID N0:171, SEQ ID 
NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID 
NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID NO:181, SEQ ID 
NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID 
NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID N0:191 , SEQ ID 

30 NO:199, SEQ IDNO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196. SEQ ID 
NO:197, SEQ ID NO:198. SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201. SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
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NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1, SEQ ID 
NO:212, SEQ ID NO:213. SEQ ID NO:214. SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:2I8, SEQ ED NO:219, SEQ ID NO:220, SEQ ID NO:221. SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 

5 NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230. SEQ ID NO:23 1 , SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242, or the corresponding full-length amino acid sequence, or fragments thereof. 
A sequence that is substantially similar to a sequence selected from the group consisting of 

10 those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, 
SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, 
SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, 
SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138. SEQ ID NO:139, SEQ ID NO:140, 
SEQ ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID N0:145, 

15 SEQ ID NO: 146, SEQ ID NO:147, SEQ ID NO: 1 48, SEQ ED NO: 149, SEQ ID NO: 1 50, 
SEQ ID NO:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, 
SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, 
SEQ ID N0:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. 
SEQ ID N0:166, SEQ ID N0:167, SEQ ID NO:168, SEQ ID N0:169, SEQ ID NO:170, 

20 SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173. SEQ ID NO:174, SEQ ID NO:175, 
SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, 
SEQ ID N0:181, SEQ ID NO: 182, SEQ BD NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, 
SEQ ID NO:186, SEQ ID NO: 187, SEQ ID NO: 188, SEQ ED NO: 189, SEQ ID NO: 190, 
SEQ ID N0:191, SEQ E) NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, 

25 SEQ ID NO:196, SEQ ID NO:197. SEQ ID NO:198. SEQ ED NO:199, SEQ ID NO:200, 
SEQ ID NO:201, SEQ ID NO:202, SEQ ED NO:203, SEQ ID NO:204, SEQ ID NO:205, 
SEQ ED NO:206, SEQ ID NO:207. SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, 
SEQ ID NO:21 1, SEQ ED NO:212. SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, 
SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, 

30 SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ED NO:225, 
SEQ ID NO:226. SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, 
SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235. 
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SEQ ID NO:236, SEQ E) NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, 
SEQ ID NO:241, and SEQ ID NO:242 will have at least 75% identity (preferably 90%, 
more preferably at least 95% and most preferably 99-100%) to a domain of a polypeptide 
selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID NO: 123, 
5 SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO:128, 
SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, 
SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136. SEQ ID NO:137, SEQ ID NO:138, 
SEQ K) NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, 
SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, 

10 SEQ ID NO:149, SEQ ID NO:150, SEQ ID N0:151, SEQ ID NO:152, SEQ ID NO:153, 
SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ ID NO: 1 57, SEQ ID NO: 1 58, 
SEQ ID NO: 1 59, SEQ ID NO: 1 60, SEQ ID NO: 1 6 1 , SEQ ID NO: 1 62, SEQ ID NO: 1 63, 
SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ED NO: 167, SEQ ID NO: 168, 
SEQ ID NO: 1 69, SEQ ID NO: 1 70, SEQ ID NO: 1 7 1, SEQ ID NO: 1 72, SEQ ID NO: 1 73, 

15 SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, 
SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ED NO: 1 82, SEQ ID NO: 1 83, 
SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, 
SEQ ED NO: 1 89, SEQ ID NO: 1 90, SEQ ID NO: 1 9 1 , SEQ ID NO: 1 99, SEQ ID NO: 1 93, 
SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, 

20 SEQ ID NO:199, SEQ ED NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ED NO:203, 
SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, 
SEQ ID NO:209, SEQ ID NO:210, SEQ ED N0:21 1, SEQ ID NO:212, SEQ DD NO:213, 
SEQ IDNO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, 
SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, 

25 SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, 
SEQ ID NO:229, SEQ ID NO:230, SEQ ED NO:23 1 , SEQ ID NO:232. SEQ ID NO:233, 
SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, 
SEQ ID NO:239, SEQ ED NO:240, SEQ ID NO:241, and SEQ ID NO:242, where the 
domain is selected from the group consisting of an N-terminal domain, a catalytic domain, 

30 a C-terminal domain, a coiled-coil structure region, a proline-rich region, a spacer region, 
an insert, and a C-terminal tail; (g) is the complement of the nucleotide sequence of (f); (h) 
encodes a polypeptide having an amino acid sequence selected from the group consisting 
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of those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124. SEQ ID 
NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO: 129, SEQ ID 
NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ED NO:134, SEQ ID 
NO:135, SEQ ID NO:136, SEQ ID NO:137. SEQ ID NO:138, SEQ ID NO:139, SEQ ID 
NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO: 143, SEQ ID NO: 144, SEQ ID 
NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID 
NO:150, SEQ ID NO:151, SEQ ID NO:152, SEQ IDNO:153, SEQ ED NO:154. SEQ ID 
NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID 
NO:160, SEQ ID N0:161, SEQ ED NO: 162, SEQ ID NO: 163, SEQ ID NO:164, SEQ ED 
NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ IDNO:168, SEQ ID NO:169, SEQ ED 
NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ IDNO:173, SEQ ID NO:174, SEQ ID 
NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID 
NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:l84, SEQ ID 
NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID N0:189, SEQ ID 
NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID 
NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID 
NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID 
NO:205, SEQ ID NO:206, SEQ ED NO:207. SEQ ED NO:208, SEQ ID NO:209, SEQ ID 
NO:210, SEQ ID N0:211, SEQ ID NO:212, SEQ ID NO:213, SEQ ED NO:214, SEQ ID 
NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID N0:219. SEQ ID 
NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID 
NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228. SEQ ID NO:229, SEQ ID 
NO:230, SEQ ID NO:231 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ED 
NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID 
NO:240, SEQ ID NO:241, and SEQ ID NO:242, or the corresponding full-length amino 
acid sequence, or fragmraits thereof A sequence that is substantially similar to a sequence 
selected from the group consisting of those set forth in SEQ ID NO:122, SEQ ID NO:123, 
SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, 
SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ED NO:132, SEQ ID NO:133, 
SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137. SEQ ID NO:138, 
SEQ ED NO:139. SEQ ID NO:140, SEQ ID N0:14l, SEQ ID NO:142, SEQ ID NO:143. 
SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ED NO:147, SEQ ID NO:148, 
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SEQ ID NO:149, SEQ ID NO:150, SEQ ID N0:151, SEQ ID NO:152, SEQ ID NO:153, 
SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158. 
SEQ ID NO:159. SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID NO:163, 
SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID NO:168, 

5 SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173, 
SEQ ID NO:174, SEQ ID NO:175, SEQ IDNO:176, SEQ IDNO:177, SEQ ID NO:178, 
SEQ ID NO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID NO:183, 
SEQ IDNO:184, SEQIDNO:185, SEQIDNO:186, SEQ IDNO:187. SEQID NO:188. 
SEQ ID NO:189, SEQ ID NO:190. SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193. 

10 SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, 
SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, 
SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208. 
SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:211, SEQ ID NO:212, SEQ E) NO:213, 
SEQ ID NO:214, SEQ ID NO:215, SEQ IDNO:216, SEQ ID NO:217, SEQ ID NO:218, 

15 SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221. SEQ ID NO:222, SEQ ID NO:223, 
SEQ ID NO:224. SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228. 
SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, 
SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, 
SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242 will have at 

20 least 75% identity (preferably 90%, more preferably at least 95% and most preferably 99- 
100%) to the sequence selected from the group consisting of those set forth in SEQ ID 
NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID 
NO: 132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO: 135, SEQ ID NO:136, SEQ ID 

25 NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ ID 
NO:142, SEQ ID N0:143, SEQ ID N0:144, SEQ ID NO:145, SEQ ID N0:146, SEQ ID 
NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID N0:151, SEQ ID 
NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID 
NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161, SEQ ID 

30 NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ED NO:165. SEQ ID NO:166, SEQ ID 
NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID NO:171, SEQ ID 
NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176. SEQ ID 
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NO:177, SEQ TD NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID 
NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID 
NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID NO:191, SEQ ID 
NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ IDNO:196, SEQ ID 

5 NO: 197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201. SEQ ID 
NO:202. SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1, SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 

10 NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242; (b) is the complement of the nucleotide sequence of (a); (c) hybridizes under 

1 5 highly stringent conditions to the nucleotide molecxile of (a) and encodes a naturally 
occurring kinase polypeptide; (d) encodes a kinase polypeptide having an amino acid 
sequence selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID 
NO:l23, SEQ ID NO:124. SEQ ID NO:125, SEQ IDNO:126, SEQ ID NO:127, SEQ ID 
NO:128. SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID N0:132, SEQ ID 

20 NO:133, SEQ ID NO:134, SEQ ID NO:135. SEQ ID NO:136, SEQ ED NO:137, SEQ ID 
NO:138, SEQ IDNO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ IDNO:142. SEQ ID 
NO: 143, SEQ ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO:147, SEQ ID 
NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ IDN0:151, SEQ ID NO: 152, SEQ ID 
NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID 

25 NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID 
NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID 
NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172. SEQ ID 
NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID 
NO:178. SEQ IDNO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID 

30 NO:183, SEQ ID NO:184. SEQ ID NO:185, SEQ ID N0:186, SEQ ID NO:187, SEQ ID 
NO:188. SEQ ID NO:189, SEQ ID NO:190, SEQ ID NO: 191, SEQ ID NO:199, SEQ ID 
NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID 
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NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:211, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 

5 NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:22I, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:24l, and SEQ ID NO:242, or the 

10 corresponding full-length amino acid sequence, or fragments thereof. A sequence that is 
substantially similar to a sequence selected from the group consisting of those set forth in 
SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, 
SEQ ID NO:127. SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, 
SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, 

15 SEQ ID NO:137, SEQ IDNO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, 
SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145. SEQ ID NO: 146, 
SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID N0:151, 
SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156. 
SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159. SEQ ID NO:160, SEQ ID NO:161, 

20 SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164. SEQ ID NO:165. SEQ ID NO:166. 
SEQ ID N0:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID N0:170, SEQ ID NO:171, 
SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176. 
SEQ ID NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, 
SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, 

25 SEQ ID NO:187. SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID NO:191. 
SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, 
SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199. SEQ ID NO:200, SEQ ID NO:201, 
SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, 
SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID N0:210, SEQ ID N0:21 1, 

30 SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, 
SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, 
SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, 
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SEQ E) NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23l, 
SEQ ID NO:232, SEQ ID NO:233. SEQ ID NO:234, SEQ ID NO:235, SEQ ED NO:236, 
SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , 
and SEQ ID NO:242 will have at least 75% identity (preferably 90%, more preferably at 

5 least 95% and most preferably 99-100%) to the sequence of SEQ ID NO:122, SEQ ID 

NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126. SEQ ID NO:127. SEQ ID 
NO:128. SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID NO:132, SEQ ID 
NO:133, SEQ IDNO:134, SEQ IDNO:135. SEQ ID NO:136, SEQ ID NO:137, SEQ ID 
NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ ID NO:142, SEQ ID 

10 NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO:148, SEQ IDNO:149, SEQ IDNO:150, SEQID N0:15U SEQ IDNO:152, SEQ ID 
NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID 
NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID 
NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID 

15 NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID NO:171, SEQ ID NO:172, SEQ ID 
NO:173. SEQ ED NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177. SEQ ID 
NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ID 
NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID 
NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID NO:191, SEQ ID NO:199, SEQ ID 

20 NO:193, SEQ ID NO:I94, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID 
NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ID NO:215. SEQ ID NO:216, SEQ ID NO:217, SEQ ID 

25 N0:21 8, SEQ ID N0:21 9, SEQ ID NO:220, SEQ ID NO:22 1, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, 

30 excq)t that it lacks one or more of the domains selected from the group consisting of a N- 
teiminal domain, a catalytic domain, a C-terminal domain, a coiled-coil structure region, a 
proline-rich region, a spacer region, an insert, and a C-terminal tail; or (i) is the 
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complement of the nucleotide sequence of (h). The domain demarcations of the 
polypeptides of the invention are indicated in Table 2 by reference to the kinase domain. 

The term "complement" refers to two nucleotides that can form multiple favorable 
interactions with one another. For example, adenine is complementary to thymine as they 

5 can form two hydrogen bonds. Similarly, guanine and cytosine are complementary since 
they can form three hydrogen bonds. A nucleotide sequence is the complement of another 
nucleotide sequence if all of the nucleotides of the first sequence are complementary to all 
of the nucleotides of the second sequence. 

The tenn "domain" refers to a region of a polypeptide that contains a particular 

10 fimction. For instance, N-terminal or C-terminal domains of signal transduction protems 
can serve functions including, but not Umited to, binding molecules that localize the signal 
transduction molecule to different regions of the cell or binding other signaling molecules 
directly responsible for propagating a particular cellular signal. Some domains can be 
expressed separately from the rest of the protein and function by themselves, while others 

1 5 must remain part of the intact protein to retain function. The latter are termed functional 
regions of proteins and also relate to domains. 

The term *'N-terminal domain" refers to the extracatalytic region located between 
the initiator methionine and the catalytic domain of the protein kinase. The N-terminal 
domain can be identified following a Smith-Waterman alignment of the protein sequence 

20 against the non-redundant protein database to define the N-terminal boundary of the 

catalytic domain. Depending on its length, the N-terminal domain may or may not play a 
regulatory role in kinase function. An example of a protein kinase whose N-terminal 
domain has been shown to play a regulatory role is PAK65, which contains a CRIB motif 
used for Cdc42 and rac binding (Burbelo, P.D. et al (1995) J. Biol. Chem. 270, 29071- 

25 29074). The N-terminal domain of a protein kinase of the invention is that portion of the 
protein kinase to the amino-terminal side of the kinase domain where the kinase domain is 
identified in Table 2, herem. Further, in some cases, portions of the N-terminal domains 
of the protein kinases of the invention have not been identified since the entire sequence is 
not available. However, with the methods described herein, the fiill-length sequences of 

30 the kinases of the invention can be determined and using the approaches described herein 
the N-terminal domain can be identified. 
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The term "catalytic domain" or "kinase domain" refers to a region of the protein 
kinase that is typically 25-300 amino acids long and is responsible for carrying out the 
phosphate transfer reaction fixim a high-energy phosphate donor molecule such as ATP or 
GTP to itself (autophosphorylation) or to other proteins (exogenous phosphorylation). 

5 The catalytic domain of protein kinases is made up of 1 2 subdomains that contain highly 
conserved amino acid residues, and are responsible for proper polypeptide folding and for 
catalysis. The catalytic domain can be identified following a Smith- Waterman alignment 
of the protein sequence against the non-redundant protein database. The catalytic/kinase 
domains of the protein kinases of the invention are identified in Table 2, herein. Further, 

1 0 in some cases, the complete sequence of the catalytic/kinase domains of the protein 
kinases of the invention may not have been provided since the entire sequence is not 
available. However, with the methods described herein, the full-length sequences of the 
kinases of the invention can be detentnined, and using the approaches described herein, the 
catalytic/kinase domain can be identified. 

15 The term "catalytic activity", as used herein, defines the rate at which a kinase 

catalytic domain phosphorylates a substrate. Catalytic activity can be measured, for 
example, by determining the amount of a substrate converted to a phosphorylated product 
as a fimction of time. Catalytic activity can be measured by methods of the invention by 
holding time constant and determining the concentration of a phosphorylated substrate 

20 after a fixed period of time. Phosphorylation of a substrate occurs at the active-site of a 
protein kinase. The active-site is normally a cavity in which the substrate binds to the 
protein kinase and is phosphorylated. 

The temi "substrate" as used herein refers to a molecule phosphorylated by a 
kinase of the invention. Kinases phosphorylate substrates on serine/threonine or tyrosine 

25 amino acids. The molecule may be another protein or a polypeptide. 

The term "C-terminal domain" refers to the region located between the catalytic 
domain and the carboxy-terminal amino acid residue of the protein kinase. The C- 
terminal domain can be identified by using a Smith-Waterman alignment of the protein 
sequence against the non-redundant protein database to define the G-terminal boundary of 

30 the catalytic domain or of any fimctional C-terminal extracatalytic domain. Depending on 
its length and amino acid composition, the C-terminal domain may or may not play a 
regulatory role in kinase fimction. An example of a protein kinase whose C-terminal 
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domain may play a regulatory role is PAK3 which contains a heterotrimeric Gb subunit- 
binding site near its C-terminus (Leeuw, T. et al (1998) Nature, 391, 191-195). The C- 
tenninal domain of a protein kinase of the invention is that portion of the protein kinase to 
the carboxy-terminal side of the kinase domain where the kinase domain is identified in 
5 Table 2, herein. In some cases, the C-terminal domains of the protein kinases of the 
invention have not been provided since the entire sequence is not available. However, 
with the methods described herein, the full-length sequences of the kinases of the 
invention can be determined, and using the approaches described herein, the C-terminal 
domain can be identified. 

10 The term "signal transduction pathway" refers to the molecules that propagate an 

extracellular signal through the cell membrane to become an intracellular signal. This 
signal can then stimulate a cellular response. The polypeptide molecules involved in 
signal transduction processes are typically receptor and non-receptor protein tyrosine 
kinases, receptor and non-receptor protein phosphatases, SRC homology 2 and 3 domains, 

15 phosphotyrosine binding proteins (SRC homology 2 (SH2) and phosphotyrosine binding 
(PTB and PH) domain containing proteins), proline-rich binding proteins (SH3 domain 
containing proteins), nucleotide exchange factors, and transcription factors. 

The terra "coiled-coil structure region" as used herein, refers to a polypeptide 
sequence that has a high probability of adopting a coiled-coil structure as predicted by 

20 computer algorithms such as COILS (Lupas, A. (1996) Meth. Enzymology 266:5 13-525). 

Coiled-coils are formed by two or three amphipathic a-helices in parallel. Coiled-coils can 
bind to coiled-coil domains of other polypeptides resulting in homo- or heterodimers 
(Lupas, A. (1991) Science 252:1162-1164). Coiled-coil-dependent oligomerization has 
been shown to be necessary for protein function including catalytic activity of 

25 serine/threonine kinases (Roe, J. et al (1997) J. Biol. Chem. 272:5838-5845). Coiled-coil 
regions in the proteins of the invention can be identified using these methods. They may 
be present as sub-domains of the N-terminal, kinase, or C-terminal domains of the 
polypeptides of the invention. 

The term *'proline-rich region'* as used herein, refers to a region of a protein kinase 

30 whose proline content over a given amino acid length is higher than the average content of 
this amino acid found in proteins (i.e., >10%). Prolme-rich regions are easily discemable 
by visual inspection of amino acid sequences and quantitated by standard computer 
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sequence analysis programs such as the DNAStar program EditSeq. ProUne-rich regions 
have been demonstrated to participate in regulatory protein -protein interactions. Among 
these interactions, those that are most relevant to this invention involve the *TxxP" proline 
rich motif found in certain protein kinases (f.e, human PAKl) and the SH3 domain of the 
adaptor molecule Nek (Galisteo, MX. et al (1996) J. Biol. Chem. 271 :20997-2 1000). 
Other regulatory interactions involving "PxxP" proline-rich motifs include the WW 
domain (Sudol, M, (1996) Prog. Biophys. Mol. Bio. 65:113-132). Proline rich regions in 
the proteins of the invention can be identified using these methods. They may be present 
as sub-domains of the N-termmal, kinase, or C-terminal domains of the polypeptides of 
the invention. 

The term "spacer region" as used herein, refers to a region of the protein kinase 
located between predicted functional domains. The spacer region has no detectable 
homology to any amino acid sequence in the database, and can be identified by using a 
Smith-Waterman alignment of the protein sequence against the non-redundant protein 
database to define the C- and N-terminal boundaries of the flanking fimctional domains. 
Spacer regions may or may not play a fimdamental role in protein kinase fimction. 
Precedence for the regulatory role of spacer regions in kinase function is provided by the 
role of the src kinase spacer in inter-domain interactions (Xu, W. et al (1997) Nature 
385:595-602). Spacer regions in the proteins of the invention can be identified using these 
methods. They may be present as sub-domains of the N-terminal, kinase, or C-terminal 
domains of the polypeptides of the invention. 

The term "insert" as used herein refers to a portion of a protein kinase that is 
absent ftom a close homolog. Inserts may or may not by the product alternative splicing 
of exons. Inserts can be identified by using a Smith- Waterman sequence alignment of the 
protein sequence against the non-redundant protein database, or by means of a multiple 
sequence alignment of homologous sequences using the DNAStar program Megalign. 
Inserts may play a fimctional role by presentmg a new interface for protein-protein 
interactions, or by interfering with such interactions. Insert regions in the proteins of the 
invention can be identified using these methods. They may be present as sub-domains of 
the N-terminal, kinase, or C-terminal domains of the polypeptides of the invention. 
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The term "C-terminal tail" as used herein, refers to a C-terminal domain of a 
protein kinase, that by homology extends or protrudes past the C-terminal amino acid of 
its closest homolog. C-terminal tails can be identified by using a Smith-Waterman 
sequence alignment of the protein sequence against the non-redundant protein database, or 

5 by means of a multiple sequence aligmnent of homologous sequences using the DNAStar 
program Megalign. Depending on its length, a C-terminal tail may or may not play a 
regulatory role in kinase function. C-terminal tail regions in the proteins of the invention 
can be identified using these methods. They may be present as sub-domains of the N- 
terminal, kinase, or C-terminal domains of the polypeptides of the invention. 

1 0 Various low or high stringency hybridization conditions may be used depending 

upon the specificity and selectivity desired. These conditions are well-known to those 
skilled in the art. Under stringent hybridization conditions only highly complementary 
nucleic acid sequences hybridize. Preferably, such conditions prevent hybridization of 
nucleic acids having more than 1 or 2 mismatches out of 20 contiguous nucleotides, more 

1 5 preferably, such conditions prevent hybridization of nucleic acids having more than 1 or 2 
mismatches out of 50 contiguous nucleotides, most preferably, such conditions prevent 
hybridization of nucleic acids having more than 1 or 2 mismatches out of 100 contiguous 
nucleotides. In some instances, the conditions may prevent hybridization of nucleic acids 
having more than 5 mismatches in the full-length sequence. 

20 By stringent hybridization assay conditions is meant hybridization assay conditions 

at least as stringent as the following: hybridization in 50% formamide, 5X SSC, 50 mM 
NaHjPO*, pH 6.8, 0.5% SDS, 0.1 mg/mL sonicated salmon sperm DNA, and 5X Denhart 
solution at 42 °C overnight; washing with 2X SSC, 0.1% SDS at 45 *C; and washing with 
0.2X SSC, O.r/o SDS at 45 ''C. Under some of the most stringent hybridization assay 

25 conditions, the second wash can be done with O.IX SSC at a temperature up to 70 '='C (pg. 

421, Berger et al (1987) Guide to Molecular Cloning Techniques , Meth. Enzym. vol. 1 52, 
hereby incorporated by reference herein including any figures, tables, or drawings.). 
However, other applications may require the use of conditions falling between these sets 
of conditions. Methods of determining the conditions required to achieve desired 

30 hybridizations are well-known to those with ordinary skill in the art, and are based on 
several factors, including but not limited to, the sequences to be hybridized and the 
samples to be tested. 
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In other preferred embodiments, the invention features isolated, enriched, or 
purified nucleic acid molecules encoding kinase polypeptides, further comprising a vector 
or promoter effective to initiate transcription in a host cell. The invention also features 
recombinant nucleic acid, preferably in a cell or an organism. The recombinant nucleic 

5 acid may contain a sequence selected from the group consisting of those set forth in SEQ 
ID N0:1, SEQ ID N0:2, SEQ ID N0:3, SEQ ID N0:4, SEQ ID N0:5, SEQ ID N0:6, 
SEQ ID N0:7, SEQ ID N0:8, SEQ ID N0:9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID 
N0:12, SEQ ID NO:13, SEQ ID N0:14, SEQ ID N0:15, SEQ ID N0:16, SEQ ID N0:17, 
SEQ E) N0:18, SEQ ID N0:19, SEQ ID NO:20, SEQ ID N0:21, SEQ ID N0:22, SEQ 

10 ID NO:23, SEQ ID NO:24, SEQ ID N0:25, SEQ ID NO:26, SEQ ID N0:27, SEQ ID 

NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID N0:31, SEQ ID NO:32, SEQ ID NO:33, 
SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ 
ID NO:39, SEQ ID NO:40, SEQ ID NO:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID 
NO:44, SEQ ID NO:45. SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, 

15 SEQ ID NO:50, SEQ ID N0:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ED NO:54, SEQ 
ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID 
NO:60, SEQ ID N0:61, SEQ ID NO:62, SEQ ED NO:63, SEQ ED NO:64, SEQ ED NO:65, 
SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ 
ID N0:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID 

20 NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID N0:81, 
SEQ ID NO:82, SEQ ID NO:83, SEQ ID N0:84, SEQ ID N0:85, SEQ ID N0:86, SEQ 
ID NO:87, SEQ ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID N0:91, SEQ ID 
NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ED NO:97, 
SEQ ID NO:98, SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:102, 

25 SEQ ID NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, 
SEQ K> NO:108, SEQ ID NO:109, SEQID NO:110, SEQ ED N0:111, SEQ IDN0:112, 
SEQ ID N0:1 13, SEQ ID N0:1 14, SEQ ID N0:1 15, SEQ ID N0:116, SEQ ID N0:1 17, 
SEQ ID N0:1 18, SEQ ED N0:1 19, SEQ ID NO:120, and SEQ ID N0:121, or a functional 
derivative fliereof and a vector or a promoter effective to initiate transcription in a host 

30 cell. The recombinant nucleic acid can alternatively contain a transcriptional initiation 
region fimctional in a cell, a sequence complementary to an RNA sequence encoding a 
kinase polypeptide and a transcriptional termination region functional in a cell. Specific 
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vectors and host cell combinations are discussed herein. The recombinant nucleic acid can 
also contain the full-length sequence encoding the protein kinase, or a domain, for 
example. 

The term ''vector" relates to a single or double-stranded circular nucleic acid 
5 molecule that can be transfected into cells and replicated within or independently of a cell 
genome. A circular double-stranded nucleic acid molecule can be cut and thereby 
linearized upon treatment with restriction enzymes. An assortment of nucleic acid vectors, 
restriction enzymes, and the knowledge of the nucleotide sequences cut by restriction 
enzymes are readily available to those skilled in the art. A nucleic acid molecule encoding 

10 a kinase can be inserted into a vector by cutting the vector vdth restriction enzymes and 
ligating the two pieces together. 

The term "transfecting" defines a number of methods to insert a nucleic acid vector 
or other nucleic acid molecules into a cellular organism. These methods involve a variety 
of techniques, such as treating the cells with high concentrations of salt, an electric field, 

15 detergent, or DMSO to render the outer membrane or wall of the cells permeable to 
nucleic acid molecules of interest or use of various viral transduction strategies. 

The term "promoter" as used herein, refers to nucleic acid sequence needed for 
gene sequence expression. Promoter regions vary fix)m organism to organism, but are well 
known to persons skilled m the art for different organisms. For example, in prokaryotes, 

20 the promoter region contains both the promoter (which directs the initiation of RNA 
transcription) as well as the DNA sequences which, when transcribed into RNA, will 
signal synthesis uiitiation. Such regions will normally mclude those 5 '-non-coding 
sequences involved with initiation of transcription and translation, such as the TATA box, 
capping sequence, CAAT sequence, and the like. 

25 In preferred embodiments, the isolated nucleic acid comprises, consists essentially 

of, or consists of a nucleic acid sequence set forth in SEQ ID N0:1, SEQ ID NO:2, SEQ 
ID N0:3, SEQ ID NO:4, SEQ ID N0:5, SEQ ID NO:6, SEQ ID N0:7, SEQ ID NO:8, 
SEQ ID N0:9, SEQ ID NO: 10, SEQ ID NO: 11, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID 
N0:14, SEQ ID N0:15, SEQ ID N0:16, SEQ ID N0:17, SEQ ID N0:18, SEQ ID NO:19, 

30 SEQ ID NO:20, SEQ ID N0:21, SEQ ED NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ 
ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID 
NO:30, SEQ ID N0:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, 
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SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ 
ID N0:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID 
NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID N0:5 1. 
SEQ ID NO:52. SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ 

5 IDNO:57,SEQIDNO:58,SEQIDNO:59.SEQIDNO:60,SEQIDNO:61,SEQID 

NO:62, SEQ ID NO:63. SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, 
SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ 
ID NO:73. SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID 
NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID NO:81, SEQ ID NO:82. SEQ ID NO:83, 

10 SEQ ID NO:84. SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ 
ID NO:89, SEQ ID NO:90, SEQ ID N0:91, SEQ ID NO:92. SEQ ID NO:93, SEQ ID 
NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, 
SEQ ID.NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, 
SEQ ID NO:105. SEQ ID NO:106, SEQ ID N0:107, SEQ ED NO:108, SEQ ID NO:109, 

15 SEQ ID NO:l 10, SEQ ID NO:l 11, SEQ ID N0:112, SEQ ID N0:1 13, SEQ ID N0:1 14, 
SEQ ID NO:l 15, SEQ ID N0:1 16, SEQ ID N0:1 17, SEQ ID N0:118, SEQ ID N0:1 19. 
SEQ ID NO: 120, and SEQ ID NO:121, or the corresponding full-length sequence, encodes 
an amino acid sequence selected from the group consisting of those set forth in SEQ ID 
NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO: 126, SEQ ID 

20 NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID 
N0:132, SEQ ID N0:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID 
NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ ID 
NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID N0:151, SEQ ID 

25 NO:152, SEQ ID NO:153, SEQ ED NO:154, SEQ ID NO:155, SEQ ID NO: 156, SEQ ID 
NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160. SEQ ID NO:161, SEQ ID 
NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID 
NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ID 
N0:I72, SEQ ID N0:173, SEQ ID NO:I74, SEQ ID N0:175. SEQ ID N0:176, SEQ ID 

30 NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID N0:181. SEQ ID 
NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ED 
NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190. SEQ ID N0:191, SEQ ID 
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NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID 
NO:197. SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201. SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205. SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ JD NO:210, SEQ ID N0:21 1, SEQ ID 
5 NO:2I2, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236. SEQ ID 

10 NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242, or the corresponding full-length amino acid sequence, a functional derivative 
thereof; or at least 10, 20, 40, 50, 75, 100, 200, 300 or 500 contiguous amino acids of a 
sequence selected from the group consisting of those set forth in SEQ ID NO:122, SEQ ID 
NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID 

15 NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID NO:132, SEQ ID 
NO:133. SEQ ID NO:134, SEQ ID NO:135. SEQ ID NO:136, SEQ ID NO:137, SEQ ID 
NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ED 
NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID 

20 NO:153, SEQ ID NO:154, SEQ ED NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID 
NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ JD N0:161, SEQ ID NO:162, SEQ ID 
NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID 
NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID 
NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID 

25 NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID 
NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID 
NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID 
NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID 
NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 

30 NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ED NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ ID NO:212, SEQ ID 
NO:213. SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
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NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ED NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228. SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 

5 NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, or the 
corresponding full-length sequences or derivatives thereof. The nucleic acid may be 
isolated from a natural source by cDNA cloning or by subtractive hybridization. The 
natural source may be manmialian, preferably human, blood, semen, or tissue, and the 
nucleic acid may be synthesized by the triester method or by using an automated DNA 

10 synthesizer. 

The term "mammal" refers preferably to such organisms as mice, rats, rabbits, 
guinea pigs, sheep, and goals, more preferably to cats, dogs, monkeys, and apes, and most 
preferably to himians. 

In yet other preferred embodiments, the nxicleic acid is a conserved or unique 

1 5 region, for example those useful for: the design of hybridization probes to facilitate 
identification and cloning of additional polypeptides, the design of PCR probes to 
facilitate cloning of additional polypeptides, obtaining antibodies to polypeptide regions, 
and designing antisense oligonucleotides. 

By "conserved nucleic acid regions" are meant regions present on two or more 

20 nucleic acids encoding a kinase polypeptide, to which a particular nucleic acid sequence 
can hybridize under lower stringency conditions. Examples of lower stringency 
conditions suitable for screening for nucleic acid encoding kinase polypeptides are 
provided in Berger et al (1987) Guide to Molecular Cloning Techniques, Meth. Enzym. 
vol. 152, hereby incorporated by reference herein in its entirety, including any drawings, 

25 figures, or tables. Preferably, conserved regions differ by no more than 5 out of 20 

nucleotides, even more preferably 2 out of 20 nucleotides or most preferably 1 out of 20 
nucleotides. 

By *^mique nucleic acid region" is meant a sequence present in a nucleic acid 
coding for a kinase polypeptide that is not present in a sequence coding for any other 
30 naturally occurring polypeptide. Such regions preferably encode 10 (preferably 25, more 
preferably 50, most preferably 75) or more contiguous amino acids selected from the 
group consisting of those set forth in SEQ ID NO:122, SEQ ED NO:123, SEQ ID N0:124, 



WEST 



wo 00/73469 



PCT/USOO/14842 



SEQ E> NO:125, SEQ E) NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, 
SEQ ID NO:130, SEQ ID N0:131, SEQ ID NO:132, SEQ ID NO:133, SEQ IDNO:134, 
SEQIDNO:135, SEQK) NO:136, SEQ ID NO:137, SEQID NO:138, SEQ IDNO:139, 
SEQ ID NO:140, SEQ ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ED NO:144, 

5 SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, 
SEQ ID NO:150, SEQ ID N0:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, 
SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, 
SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, 
SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167. SEQ IDNO:I68, SEQ ID NO:169, 

10 SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, 
SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179. 
SEQ ID NO: 1 80, SEQ ID NO: 1 81, SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, 
SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, 
SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, 

15 SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, 
SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203. SEQ ID NO:204, 
SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209. 
SEQ ID NO:210, SEQ ID N0:211, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, 
SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219. 

20 SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, 
SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, 
SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, 
SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ED NO:238, SEQ ID NO:239, 
SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, or functional derivatives thereof. 

25 In particular, a unique nucleic acid region is preferably of mammalian origin and 
preferably human. 

A second aspect of the invention featiu-es a nucleic acid probe for the detection of 
nucleic acid encoding a kinase polypeptide in a sample, wherein said polypeptide is 
selected fiom the group consisting of SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, 
30 SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, 
SEQ ID NO:130, SEQ ID N0:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, 
SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID N0:138. SEQ ID NO:139, 
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SEQ ID NO:140, SEQ ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO: 144, 
SEQ ID NO:145, SEQ ID NO:146. SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO: 149. 
SEQ ID NO:150, SEQ IDN0:151. SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, 
SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ED NO:158. SEQ ID NO:159, 
SEQ ID NO:160, SEQ IDN0:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ED NO:164, 
SEQ ID NO:165. SEQ IDNO:166, SEQ ID NO:167, SEQ ID NO:168. SEQ ID NO:169, 
SEQIDNO:170, SEQ IDN0:171, SEQIDNO:172, SEQIDNO:173, SEQIDNO:174, 
SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, 
SEQ ID NO:180, SEQ IDN0:181, SEQ IDN0:182. SEQ ID NO:183, SEQ ID NO:184. 
SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, 
SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, 
SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, 
SEQ ID NO:200. SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 
SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208. SEQ ID NO:209, 
SEQ ID NO:210. SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ED NO:214, 
SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218. SEQ ID NO:219, 
SEQ ID NO:220, SEQ ID NO:221, SEQ ED NO:222. SEQ ID NO:223. SEQ ID NO:224, 
SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227. SEQ ID NO:228. SEQ ID NO:229, 
SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233. SEQ ID NO:234, 
SEQ ID NO:235, SEQ ID NO:236, SEQ ED NO:237, SEQ ID NO:238, SEQ ID NO:239, 
SEQ ED NO:240, SEQ ID NO:241 , and SEQ ID NO:242. Preferably, the nucleic acid 
probe encodes a kinase polypeptide that is a fragment of the protein encoded by an amino 
acid seqxience selected from the group consisting of those set forth in SEQ ID NO: 122, 
SEQ ID NO:123, SEQ ID NO:124. SEQ ID NO:125, SEQ ID NO:126, SEQ ED NO:127, 
SEQ BO NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID NO:132, 
SEQ ID NO:133, SEQ ED N0:134, SEQ E) NO:135, SEQ ID NO:136, SEQ ED NO:137, 
SEQ ED NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ ID NO:142, 
SEQ ID NO:143. SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, 
SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID N0:151, SEQ ID NO:152, 
SEQ ID NO: 1 53, SEQ ID NO: 154, SEQ ID NO:155, SEQ ID NO: 156, SEQ ED NO: 157, 
SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO: 161, SEQ ID NO:162, 
SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ED NO:166, SEQ ID NO:167, 
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SEQ ID NO:168, SEQ IDNO:169, SEQ IDNO:170, SEQ ID N0:171, SEQ ED NO:172, 
SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, 
SEQ ID NO:178, SEQ ID NO:179, SEQ IDNO:180, SEQ ID N0:181, SEQ ID NO:182, 
SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, 

5 SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199, 
SEQ ID NO:193, SEQ IDNO:194, SEQ ID NO:195, SEQ ID N0:196, SEQ ID NO:197, 
SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200. SEQ ID NO:201 . SEQ ID NO:202, 
SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, 
SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1, SEQ ID NO:212, 

10 SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, 
SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, 
SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226. SEQ ID NO:227. 
SEQ ID NO:228, SEQ IDNO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, 
SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, 

15 SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ ID 

NO:242, or the corresponding full-length amino acid sequences. The nucleic acid probe 
contains a nucleotide base sequence that will hybridize to a sequence selected from the 
group consisting of those set forth in SEQ ID N0:1, SEQ ID N0:2, SEQ ID N0:3, SEQ 
ID N0:4. SEQ ID N0:5, SEQ ID N0:6, SEQ ID N0:7, SEQ ID N0:8, SEQ ID N0:9, 

20 SEQ ID NO:10, SEQ ID N0:11, SEQ ID N0:12, SEQ ID N0:13, SEQ ID N0:14, SEQ 
ID N0:15, SEQ ID N0:16, SEQ ID N0:17, SEQ ID NO:18, SEQ ID NO:19, SEQ ID 
NO:20, SEQ ID N0:21. SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, 
SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ 
ED N0:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID 

25 NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:40, SEQ ID N0:41 , 
SEQ ED NO:42, SEQ ID NO:43, SEQ ID N0:44, SEQ ID NO:45, SEQ ID NO;46, SEQ 
ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID 
NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, 
SEQ ID NO:58, SEQ ID NO:59, SEQ JD NO:60, SEQ ID N0:^1, SEQ ID NO:62, SEQ 

30 ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID 

NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID N0:71, SEQ ID NO:72, SEQ ID NO:73, 
SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ 
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ID NO:79, SEQ ID NO:80, SEQ ID N0:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ED 
NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:89, 
SEQ ID NO:90, SEQ ID N0:91, SEQ ID NO:92, SEQ ID NO:93, SEQ ID NO:94, SEQ 
ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID 

5 NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID 
NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108. SEQ ID NO:109, SEQ ID 
NO:110, SEQ ID NO: 111, SEQ ID N0:112, SEQ ID NO: 113, SEQ ID NO: 114, SEQ ID 
NO:115, SEQ IDN0:116, SEQ IDN0:117, SEQ ID N0:118, SEQ IDNO:119, SEQ ID 
NO:120, and SEQ ID N0:121, or the coiresponding full-length sequence, or a functional 

10 derivative thereof. 

In preferred embodiments, the nucleic acid probe hybridizes to nucleic acid 
encoding at least 6, 12, 75, 90, 105, 120, 150, 200, 250, 300 or 350 contiguous amino 
acids of a sequence selected from the group consisting of those set forth in SEQ ID 
NO:122, SEQ IDNO:123, SEQ IDNO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 

15 NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID 
NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID 
NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ ID 
NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO:147, SEQ IDNO:148, SEQ IDNO:149, SEQ ID NO:150, SEQ IDN0:151, SEQ ID 

20 NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID 
NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ IDN0:161, SEQ ID 
NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID 
NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID NO:171, SEQ ID 
NO:172. SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID 

25 NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID NO:181, SEQ ID 
NO:182, SEQ ID NO:183, SEQ ID NO:184. SEQ ID NO:185, SEQ ID NO:186, SEQ ID 
NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID 
NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID 
NO:197, SEQ ID NO:198. SEQ ID NO:199, SEQ ID N0:200, SEQ ID NO:201, SEQ ID 

30 NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1 , SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
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NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ED NO:228, SEQ ID NO:229, SEQ ID NO:230. SEQ ID NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
5 NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242, or the corresponding full-length amino acid sequence, or functional 
derivatives thereof. 

Methods for using the probes include detecting the presence or amount of kinase 
RNA in a sample by contacting the sample with a nucleic acid probe imder conditions 

10 such that hybridization occurs and detecting the presence or amoimt of the probe bound to 
kinase RNA. The nucleic acid duplex formed between the probe and a nucleic acid 
sequence coding for a kinase polypeptide may be used in the identification of the sequence 
of the nucleic acid detected (Nelson et al. , iii Nonisotopic DNA Probe Techniques, 
Academic Press, San Diego, Kricka, ed., p. 275, 1992, hereby incorporated by reference 

1 5 herein in its entirety, including any drawings, figures, or tables) . Kits for performing such 
methods may be constructed to include a container means having disposed therein a 
nucleic acid probe. 

In a third aspect, the invention describes a recombinant cell or tissue comprising a 
nucleic acid molecule raicoding a kinase polypeptide selected from the group consisting of 

20 SEQ ID NO:122, SEQ ED NO:123, SEQ ID NO:124. SEQ ID NO:125, SEQ ID NO:126, 
SEQ ID NO:127, SEQ ID NO:128, SEQ ED NO:129, SEQ ID NO:130, SEQ ID N0:131, 
SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO: 134, SEQ ID NO:135, SEQ ID NO: 136, 
SEQ ID NO:137, SEQ ID NO:138, SEQ IDNO:139, SEQ ID NO:140, SEQ ID NO: 141, 
SEQ ID NO:142, SEQ ID NO:143, SEQ IDNO:144, SEQ ID NO:145, SEQ ID N0:146, 

25 SEQ ID NO:147, SEQ ED NO:148, SEQ IDNO:149, SEQ ID NO:150, SEQ ID N0:151, 

SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, 
SEQ ID N0:157. SEQ ID NO:158, SEQ IDNO:159, SEQ ED NO:160, SEQ ED N0:161, 
SEQ ID N0:162, SEQ ID N0:163, SEQ ID N0:164, SEQ ID N0:165. SEQ ID NO:166, 
SEQ ID NO: 167, SEQ ID NO: 1 68, SEQ ID NO: 169, SEQ ED NO: 1 70, SEQ ID NO: 1 7 1 , 

30 SEQ ID NO:172, SEQ ID NO:173, SEQ ED NO:174, SEQ ID NO:175, SEQ ID NO:176, 
SEQ n)NO:177, SEQ ID NO:178, SEQ IDNO:179, SEQ IDNO:180, SEQ ID N0:181, 
SEQ ID NO: 1 82, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 1 86, 
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SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID N0:191, 
SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194. SEQ ID NO:195, SEQ ID NO:196, 
SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200. SEQ ID NO:201, 
SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, 

5 SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1, 
SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, 
SEQ ID NO:217, SEQ ID NO:218. SEQ IDNO:219, SEQ ID NO:220, SEQ ID NO:221, 
SEQ ED NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ED NO:226, 
SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , 

10 SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, 
SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239. SEQ ID NO:240, SEQ ID NO:241, 
and SEQ ID NO:242. In such cells, the nucleic acid may be under the control of the 
genomic regulatory elements, or may be under the control of exogenous regulatory 
elements including an exogenous promoter. By "exogenous" it is meant a promoter that is 

1 5 not normally coupled in vivo transcriptionally to the coding sequence for the kinase 

polypqjtides. 

The polypeptide is preferably a fragment of the protein encoded by an amino acid 
sequence selected from the group consisting of those set forth in SEQ ID NO:122, SEQ ID 
NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ED 

20 NO:128, SEQ ID NO:129, SEQ ID NO:i30, SEQ ID N0:131, SEQ ID NO:132, SEQ ID 
NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID 
NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ ID NO:142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ED NO:145, SEQ ID N0:146, SEQ ID NO:147, SEQ ID 
NO:148, SEQ ID NO:149. SEQ ID NO:150, SEQ ID N0:151, SEQ ID NO:152, SEQ ID 

25 NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:I56, SEQ ED NO:157, SEQ ID 
NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID 
NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID 
NO:168, SEQ ID NO:169, SEQ ED NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID 
NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID 

30 NO:178, SEQ ID NO: 179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID 
NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ED NO:186, SEQ ID NO:187, SEQ ID 
NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID NO:191, SEQ ID NO:199, SEQ ID 
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NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID 
NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID 
5 NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ED NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231. SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 

10 NO:238. SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, or the 
corresponding full-length amino acid sequence. By "fragment," is meant an amino acid 
sequence present in a kinase polypeptide. Preferably, such a sequence comprises at least 
10, 20, 40, 50, 75, 100, 200, or 300 contiguous amino acids a sequence selected from the 
group consisting of those set forth in SEQ ID NO:122. SEQ ID NO:123, SEQ ID NO:124, 

15 SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, 
SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:I32, SEQ ID NO:133, SEQ ID NO:134, 
SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ED NO:138, SEQ ID NO:139. 
SEQ ED NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144. 
SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, 

20 SEQ ID NO: 1 50, SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53 , SEQ ID NO: 1 54, 
SEQIDNO:155, SEQIDNO:156,SEQIDNO:157,SEQIDNO:158, SEQIDNO:159, 
SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, 
SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, 
SEQ ID NO:170, SEQ ED NO:171, SEQ ID NO:I72, SEQ ID NO:173, SEQ ID NO:174, 

25 SEQ ID NO: 175, SEQ BD NO: 176, SEQ ID NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, 
SEQ ID NO: 180, SEQ ID N0:181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, 
SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, 
SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194. 
SEQ ID NO:195, SEQ ID NO:196, SEQ ED NO:197, SEQ ID NO:198, SEQ ID NO:199, 

30 SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 
SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, 
SEQ ID N0:210, SEQ ID N0:211, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, 
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SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID N0:218, SEQ ID NO:219, 
SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, 
SEQ ID NO:225. SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, 
SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234. 
SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237. SEQ ID NO:238, SEQ ID NO:239, 
SEQ ED NO:240, SEQ ID NO:241, and SEQ ID NO:242, or of the corresponding full- 
length amino acid sequence, or a functional derivative thereof. 

In a fourth aspect, the invention features an isolated, enriched, or purified kinase 
polypeptide selected from the group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ 
ID NO:124. SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ 
ID NO:129, SEQ ID NO:130, SEQ IDN0:131, SEQ ID NO:132, SEQ ID NO:133. SEQ 
ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ 
ID NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ 
ID NO:144, SEQ ID NO:145, SEQ IDNO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ 
ID NO:149, SEQ ID NO:150. SEQ IDNO:151, SEQ IDNO:152, SEQ IDNO:153, SEQ 
ID NO:154, SEQ ID NO:155, SEQ ID NO:156. SEQ ID NO:157, SEQ ID NO:158. SEQ 
ID NO:159, SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID NO:163, SEQ 
ID NO:164, SEQ ID NO:165. SEQ ED NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ 
ID NO:169, SEQIDNO:170, SEQIDN0:171. SEQ IDNO:172, SEQ IDNO:173, SEQ 
ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ 
ID NO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID NO:183. SEQ 
ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ 
ID NO:189. SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ 
ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ 
ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202. SEQ ID NO:203, SEQ 
ID NO:204, SEQ ID NO:205. SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ 
ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ 
ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ 
ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ 
ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ 
ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233. SEQ 
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ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ 
ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. 

By "isolated" in reference to a polypeptide is meant a polymer of amino acids (2 or 
more amino acids) conjugated to each other, including polypeptides that are isolated from 

5 a natural source or that are synthesized. The isolated polypeptides of the present invention 
are unique in the sense that they are not found in a pure or separated state in nature. Use 
of the term 'isolated" indicates that a naturally occurring sequence has been removed from 
its normal cellular environment Thus, the sequence may be in a cell-free solution or 
placed in a different cellxilar environment. The tem does not imply that the sequence is 

10 the only amino acid chain present, but that it is essentially free (about 90 - 95% pure at 
least) of non-amino acid material naturally associated with it. 

By the use of the term "enriched" in reference to a polypeptide is meant that the 
specific amino acid sequence constitutes a significantly higher flection (2-5 fold) of the 
total amino acid sequences present in the cells or solution of interest than in normal or 

15 diseased cells or in the cells from which the sequence was taken. This could be caused by 
a person by preferential reduction in the amount of other amino acid sequences present, or 
by a preferential increase in the amount of the specific amino acid sequence of interest, or 
by a combination of the two. However, it should be noted that enriched does not imply 
that there are no other amino acid sequences present, just that the relative amount of the 

20 sequence of interest has been significantly increased. The term significant here is used to 
indicate that the level of increase is usefiil to the person making such an increase, and 
generally means an increase relative to other amino acid sequences of about at least 2-fold, 
more preferably at least 5- to 10-fold or even more. The term also does not imply that 
there is no amino acid sequence from other sources. The other source of amino acid 

25 sequences may, for example, comprise amino acid sequence encoded by a yeast or 

bacterial genome, or a cloning vector such as pUC19. The term is meant to cover only 
those situations in which man has intervened to increase the proportion of the desired 
amino acid sequence. 

It is also advantageous for some purposes that an amino acid sequence be in 

30 purified form, The term "purified" in reference to a polypeptide does not require absolute 
purity (such as a homogeneous preparation); instead, it represents an indication that the 
sequence is relatively purer than in the natural enviroimient. Compared to the natural level 
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this level should be at least 2-5 fold greater (e.g., in terms of mg/mL). Purification of at 
least one order of magnitude, preferably two or three orders, and more preferably four or 
five orders of magnitude is expressly contemplated. The substance is preferably free of 
contamination at a functionally significant level, for example 90%, 95%, or 99% pure. 

5 In preferred embodiments, the kinase polypeptide is a firagment of the protein 

encoded by an amino acid sequence selected fiom the group consisting of those set forth in 
SEQ ID NO:122, SEQ ID NO:123. SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126. 
SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, 
SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, 

10 SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:I41, 
SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, 
SEQ ID NO:I47, SEQ ID NO:148, SEQ ID NO:149. SEQ ID NO:150, SEQ ID NO:151, 
SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154. SEQ ID NO:155, SEQ ID NO:156. 
SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161, 

15 SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166. 

SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171, 
SEQ ID NO:172, SEQ ID NO:173. SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176. 
SEQ ID NO:177, SEQ ID NO:178, SEQ ID N0:179, SEQ ID NOilBO, SEQ ID N0:181. 
SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, 

20 SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID N0:191, 
SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, 
SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, 
SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, 
SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:211, 

25 SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214. SEQ ID NO:215, SEQ ID NO:216, 

SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, 
SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, 
SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, 
SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, 

30 SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , 
and SEQ ID NO:242, or the corresponding full-length amino acid sequences. Preferably, 
the kinase polypeptide contains at least 10, 20, 40, 50, 75, 100, 200, or 300 contiguous 
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amino acids a sequence selected from the group consisting of those set forth in SEQ ID 
NO: 122, SEQ ID NO:123, SEQ ID NO: 124, SEQ ID NO:125, SEQ ID NO: 126. SEQ ID 
NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID 
NO:132. SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID 

5 NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO: 141, SEQ ID 
NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO:147, SEQ ID NO:148, SEQ ID NO: 149, SEQ ID NO:150, SEQ ID N0:151, SEQ ID 
NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID 
NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ ID 

10 NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166. SEQ ID 
NO: 1 67, SEQ ID NO: 1 68, SEQ ID NO: 169, SEQ ID N0:1 70, SEQ ID NO: 1 71 , SEQ ID 
NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID 
NO:177, SEQ ID NO:178, SEQ ID NO:i79, SEQ IDNO:180, SEQ ID N0:181, SEQ ID 
NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID 

15 NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID N0:191, SEQ ID 
NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID 
NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201. SEQ ED 
NO:2G2, SEQ BD NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1 , SEQ ID 

20 NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220. SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 
NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 

25 NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , and SEQ 
ID NO:242, or the corresponding full-length amino acid sequence, or a functional 
derivative thereof. 

In preferred embodiments, the kinase polypeptide comprises an amino acid 
sequence having (a) an amino add sequence selected from the group consisting of those 
30 set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124. SEQ ID NO:125, SEQ 
ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ 
ID N0:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ 
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ID N0:136, SEQ ID NO:137, SEQ ID NO:138, SEQ IDNO:139, SEQ ED NO:140, SEQ 
ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ED NO:145, SEQ 
ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ED NO:150, SEQ 
ID N0:151, SEQ IDNO:152, SEQ IDNO:153, SEQ ID NO:154, SEQ ED NO:155, SEQ 
5 ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ 
ID N0:161, SEQ IDNO:162. SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ 
ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO: 170, SEQ 
ID N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ 
IDNO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ED NO:179, SEQ ID NO:180, SEQ 

10 IDNO:181, SEQ ID NO:182, SEQ ID NO:183. SEQ ID NO:184, SEQ ID NO:185, SEQ 
ID NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ ED NO: 1 90. SEQ 
ED N0:191, SEQ ED NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ 
ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ED NO:199, SEQ ID NO:200, SEQ 
ID NO:201, SEQ ED NO:202, SEQ ID NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ 

15 ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ED NO:209, SEQ ID NO:210, SEQ 
IDN0:211, SEQ ID NO:212, SEQ IDNO:213, SEQ IDNO:214, SEQ ID NO:215, SEQ 
ED NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ 
ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ 
ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 

20 ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
ID NO:236, SEQ ED NO:237, SEQ ED NO:238, SEQ ED NO:239, SEQ ID NO:240, SEQ 
ID NO:241 , and SEQ ID NO:242; (b) an amino acid sequence selected from the group 
consisting of those set forth in SEQ ED NO:122, SEQ ED NO: 123, SEQ ID NO: 1 24, SEQ 
ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ 

25 ID NO:130, SEQ ID N0:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ 
ED NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ED NO:139, SEQ 
ID NO:140, SEQ ID N0:141, SEQ ID NO: 142, SEQ ED NO: 143, SEQ ED NO: 144, SEQ 
ID NO:145, SEQ ED NO: 146, SEQ ID NO: 147, SEQ ID NO: 148, SEQ ED NO: 149, SEQ 
ID NO:150, SEQ ID N0:151, SEQ ID NO:152, SEQ ID NO:153i SEQ ID NO:154, SEQ 

30 ID NO:155, SEQ ID NO:156, SEQ ED NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ 
ID NO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ 
ID NO:165. SEQ IDNO:166, SEQ IDNO:167, SEQ ED NO:168, SEQ IDNO:169, SEQ 
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IDNO:170, SEQ ID N0:171, SEQ K) NO:172, SEQ IDNO:173. SEQ ID NO:l74, SEQ 
ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ 
IDNO:180, SEQ IDN0:181, SEQ IDNO:I82, SEQ IDNO:183, SEQ ID NO:184, SEQ 
ID NO: 1 85, SEQ ro NO: 1 86, SEQ ID NO: 1 87, SEQ ID NO: 1 88, SEQ ID NO: 1 89, SEQ 
5 ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194. SEQ 
ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ 
ID NO:200. SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ 
ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ 
ID NO:210, SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ 
10 ID N0:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ 
ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ 
ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ 
ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ 
ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ 
15 ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242, except that it lacks one or more, but 
not all, of a domain selected from the group consisting of an N-terminal domain, a 
catalytic domain, a C-terminal domain, a coiled-coil structure region, a proline-rich region, 
a spacer region, an insert, and a C-terminal tail; (c) an amino acid sequence of a domain of 
a polypeptide selected from the groiq) consisting of those set forth in SEQ ID NO:122, 
20 SEQ ID NO:123. SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO: 127, 
SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, 
SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID N0:136, SEQ ID NO:137, 
SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, 
SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, 
25 SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID N0:151, SEQ ID NO:152, 
SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, 
SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, 
SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, 
SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172, 
30 SEQ ID NO:173. SEQ ID.NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, 
SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO:181, SEQ ID NO: 182, 
SEQ ED NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, 
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SEQ ID NO:188, SEQ ID NO:189, SEQ IDNO:190, SEQ ID N0:191, SEQ ID NO:199, 
SEQ ID NO:193, SEQ ID NO:194, SEQ IDNO:195, SEQ ID NO:196, SEQ ID NO:197, 
SEQ ID NO:198, SEQ ID NO: 199, SEQ IDNO:200, SEQ ID NO:201, SEQ ID NO:202, 
SEQ ID NO:203, SEQ ID NO:2(j4, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, 

5 SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1, SEQ ID NO:212, 
SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, 
SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:22U SEQ ID NO:222, 
SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225. SEQ ID NO:226, SEQ ID NO:227, 
SEQ ID NO:228, SEQ ID NO:229. SEQ ID NO:230, SEQ ID NO:231. SEQ ID NO:232, 

10 SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, 
SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID 
NO:242 where the domain is selected from the group consisting of an N-terminal domain, 
a catalytic domain, a C-teiminal domain, a coiled-coil structure region, a proline-rich 
region, a spacer region, an insert, and a C-tenninal tail; or (d) an amino acid sequence 

15 selected from the group consisting of those set forth in SEQ ID NO: 122, SEQ ID NO: 123, 
SEQ ID NO:124, SEQ ID N0:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, 
SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ E) NO:132, SEQ ID NO:133. 
SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, 
SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ ID NO:142. SEQ ID NO:143, 

20 SEQ ID NO:144, SEQ IDNO:145, SEQ ID NO: 146, SEQ ID NO:147, SEQ ID NO:148, 
SEQ IDNO:149, SEQ n)NO:150, SEQ IDN0:151, SEQ IDNO:152, SEQ IDNO:153, 
SEQ ID NO:154, SEQ ID NO:155, SEQ IDNO:156, SEQ ID NO:157, SEQ ID NO:158, 
SEQ ID NO:!59, SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID NO:163, 
SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166. SEQ ID NO:167, SEQ ID NO:168, 

25 SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172. SEQ ID NO:173, 
SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID N0:1 78, 
SEQ IDNO:179, SEQ IDNO:180, SEQ IDNO:181, SEQID NO:182, SEQ IDNO:183, 
SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188. 
SEQ ID NO:189, SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, 

30 SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, 
SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, 
SEQ ID NO:204. SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, 
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SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:211, SEQ IDNO:212, SEQ ID NO:213, 
SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:2I6, SEQ IDNO:217, SEQ ID NO:218, 
SEQ ID NO:219, SEQ ID NO:220. SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, 
SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, 

5 SEQ ID NO:229. SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, 
SEQ ID NO:234, SEQ DD NO:235. SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, 
SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, except that it 
lacks one or more, but not all, of the domains selected ftom the group consisting of a C- 
taminal domain, a catalytic domain, an N-tenninal domain, a spacer region, a proline-rich 

10 region, a coiled-coil structure region, an insert, and a C-terminal tail. (The domain 

demarcations of the polypeptides of the invention are indicated in Table 2 by reference to 
the kinase domain.) 

The polypeptide can be isolated fiom a natural source by methods well-known in 
the art. The natural source may be mammalian, preferably human, blood, semen, or tissue, 

15 and the polypeptide may be synthesized using an automated polypeptide synthesizer. The 
isolated, enriched, or purified kinase polypeptide is preferably selected from the group 
consisting of those set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ 
ID NO:125, SEQ ID NO:126. SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129. SEQ 
ID NO:130, SEQ ID N0:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ 

20 ID NO:135, SEQ IDN0:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID N0:139, SEQ 
ID NO:140, SEQ ID NO:141, SEQ ID NO:l42, SEQ ID NO:143, SEQ ID NO:144, SEQ 
ID NO:145, SEQ ID NO:146, SEQ ID N0:147, SEQ ID NO:U8, SEQ ID NO:149, SEQ 
ID NO:150, SEQ IDNO:151, SEQ IDNO:152, SEQ ID NO:153, SEQ IDNO:154, SEQ 
ID NO:155, SEQ IDNO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159. SEQ 

25 ID NO:160, SEQ ID N0:161. SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ 
ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:l69, SEQ 
ID N0:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ 
ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ 
ID NO:180, SEQ ID NO:181, SEQ ID NO:182, SEQ ID NO:183. SEQ ID NO:184, SEQ 

30 ID NO:185. SEQ ID NO:186, SEQ ID NO:187. SEQ ID NO:188, SEQ ID NO:189. SEQ 
ID N0:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ 
ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ 
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E) NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ 
ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ 
ID NO:210. SEQ rDN0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ 
ID NO:215, SEQ IDNO:216, SEQ ID NO:217. SEQ ID NO:218, SEQ ID NO:219, SEQ 

5 ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223. SEQ ID NO:224. SEQ 
ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ 
ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ 
ID NO:235, SEQ ID NO:236, SEQ ID NO:237. SEQ ID NO:238, SEQ ID NO:239, SEQ 
ID NO:240, SEQ ID NO:241, and SEQ ID NO:242A. 

10 In some embodiments the invention includes a recombinant kinase polypeptide 

selected fiom the group consisting of SEQ ID NO:122, SEQ ID NO:I23, SEQ ID NO:124, 
SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127. SEQ ID NO:128, SEQ ID NO:129, 
SEQ ID NO.ISO, SEQ ID N0:131, SEQ ID NO:132. SEQ ID NO:133. SEQ ID NO:134, 
SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, 

15 SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, 

SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, 
SEQ IDNO:150, SEQ ID N0:151, SEQ ID NO:152, SEQ ID NO:153. SEQ ID NO:154, 
SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, 
SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162. SEQ ID NO:163, SEQ ID NO:164, 

20 SEQ IDNO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, 

SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, 
SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, 
SEQ IDNO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, 
SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, 

25 SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, 
SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198. SEQ ID N0:199. 
SEQ ID NO:200. SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204. 
SEQ ID NO:205, SEQ ID NO:206. SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, 
SEQ ID NO:210, SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, 

30 SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, 
SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, 
SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, 
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SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, 
SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237. SEQ ID NO:238, SEQ ID NO:239; 
SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. By "recombinant kinase 
polypeptide" is meant a polypeptide produced by recombinant DNA techniques such that 
5 it is distinct from a naturally occurring polypeptide either in its location (e.g. , present in a 
different cell or tissue than found in nature), purity or structure. Generally, such a 
recombinant polypeptide will be present in a cell in an amount different from that 
normally observed in nature. 

In a fifth aspect, the invention features an antibody {e.g., a monoclonal or 

10 polyclonal antibody) having specific binding affinity to a kinase polypeptide or a kinase 
polypeptide domain or fragment where the polypeptide is selected from the group 
consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ 
ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID N0:129, SEQ ID NO:130, SEQ 
ID NO:131, SEQ IDNO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135. SEQ 

15 ED NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ 
ID N0:141. SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ 
ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ 
ID N0:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ 
ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159. SEQ ID NO:160, SEQ 

20 ID N0:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ 
ID NO:166, SEQ ID NO:167, SEQ ID NO:168. SEQ ID NO:169, SEQ ID NO:170, SEQ 
ID N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ 
ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180. SEQ 
ID N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ 

25 ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ 
ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ 
ID NO:196, SEQ ID NO:197, SEQ ID NO:198. SEQ ID NO:199, SEQ ID NO:200, SEQ 
ID NO:201, SEQ ID NO:202, SEQ ID NO:203. SEQ ID NO:204, SEQ ID NO:205, SEQ 
ID NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ 

30 ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ 
ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ 
ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ 
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ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229. SEQ ID NO:230, SEQ 
ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:241 , and SEQ ID NO:242. In preferred embodiments, the antibody binds 

5 specifically to domains of kinase polypeptides, that are defined supra. 

By "specific binding affinity" is meant that the antibody binds to the target kinase 
polypeptide with greater affinity than it binds to other polypeptides under specified 
conditions. Antibodies or antibody fi-agments are polypeptides that contain regions that 
can bind other polypeptides. The term "specific binding affinit/' describes an antibody 

1 0 that binds to a kinase polypeptide with greater affinity than it binds to other polypeptides 
under specified conditions. 

The term "polyclonal" refers to antibodies that are heterogenous populations of 
antibody molecules derived firom the sera of animals immunized with an antigen or an 
antigenic functional derivative thereof For the production of polyclonal antibodies, 

15 various host animals may be inmiunized by injection with the antigen. Various adjuvants 

maybe used to increase the immunological response, depending on the host species. 

"Monoclonal antibodies" are substantially homogenous populations of antibodies 
to a particular antigen. They may be obtained by any technique which provides for the 
production of antibody molecules by continuous cell lines in culture. Monoclonal 

20 antibodies may be obtained by methods known to those skilled in the art (Kohler et al , 
Nature 256:495-497, 1975, and U.S. Patent No, 4,376,110, both of which are hereby 
incorporated by reference herein in their entirety including any figures, tables, or 
drawings). 

The term "antibody firagment" refers to a portion of an antibody, often the hyper 
25 variable region and portions of the surrounding heavy and light chains, that displays 

specific binding affinity for a particular molecule. A hyper variable region is a portion of 
an antibody that physically binds to the polypeptide target. 

Antibodies or antibody firagments having specific binding affinity to a kinase 
polypeptide or domains of a kinase polypeptide of the invention may be used in methods 
30 for detecting the presence and/or amount of kinase polypeptide in a sample by probing the 
sample with the antibody under conditions suitable for kinase-antibody immunocomplex 
formation and detecting the presence and/or amount of the antibody conjugated to the 
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kinase polypeptide. Diagnostic kits for perfonning such methods may be constructed to 
include antibodies or antibody fragments specific for the kinase as well as a conjugate of a 
binding partner of the antibodies or the antibodies themselves. 

An antibody or antibody fragment with specific binding affinity to a kinase 
5 polypeptide of the invention can be isolated, enriched, or purified from a prokaryotic or 

eukaryotic organism. Routine methods known to those skilled in the art enable production 
of antibodies or antibody fragments, in both prokaryotic and eukaryotic organisms. 
Purification, enrichment, and isolation of antibodies, which are polypeptide molecules, are 
described above. 

1 0 Antibodies having specific binding affinity to a kinase polypeptide of the invention 

may be used in methods for detecting the presence and/or amount of kinase polypeptide in 
a sample by contacting the sample with the antibody under conditions such that an 
immunocomplex forms and detecting the presence and/or amount of the antibody 
conjugated to the kinase polypeptide. Diagnostic kits for performing such methods may 

1 5 be constructed to include a first container containing the antibody and a second container 
having a conjugate of a binding partner of the antibody and a label, such as, for example, a 
radioisotope. The diagnostic kit may also include notification of an FDA approved use 
and instructions therefor. 

In a sixth aspect, the invention feaUires a hybridoma which produces an antibody 

20 having specific binding affinity to a kinase polypeptide or a kinase polypeptide domain, 
where the polypeptide is selected from the group consisting of SEQ ID NO: 122, SEQ ID 
NO:123, SEQ DD NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127. SEQ ID 
NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131. SEQ ID NO:132, SEQ ID 
NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID 

25 NO:138, SEQ ID NO:139, SEQ ID N0:140, SEQ ID N0:141. SEQ ID NO:142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO:148, SEQ ID NO:149, SEQ ID NO:150. SEQ ID N0:151, SEQ ID NO:152, SEQ ID 
NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID 
NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID 

30 NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167. SEQ ID 
NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID 
NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID 
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NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID 
NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID 
NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID 
NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID 

5 NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ED NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 
NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ID NO:215. SEQ ID NO:216, SEQ ID NO:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 

10 NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228. SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 
NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236. SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ED NO:242; and 
where the domains are defined as above. By "hybridoma" is meant an immortalized cell 

15 line that is capable of secreting an antibody, for example an antibody to a kinase of the 

invention. In preferred embodiments, the antibody to the kinase comprises a sequence of 
amino acids that is able to specifically bind a kinase polypeptide of the invention. 

Id a seventh aspect, the invention features a kinase polypeptide binding agent able 
to bind to a kinase polypeptide selected from the group consisting of SEQ ID NO:122, 

20 SEQ ID NO:123, SEQ ID NO:124. SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, 
SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID NO:132, 
SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ED NO:136, SEQ ID NO:137, 
SEQ ED NO: 138, SEQ ID NO: 139, SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, 
SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ED NO:147, 

25 SEQ ID NO:148, SEQIDNO:149, SEQ IDNO:150, SEQ IDN0:151, SEQ ID NO:152, 
SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, 
SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162. 
SEQ ID NO:163, SEQ ID NO:164, SEQ ED NO:165. SEQ ID NO:166, SEQ ID NO:167, 
SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID NO:171, SEQ ID NO:172, 

30 SEQ ID NO:173, SEQ ID NO:174, SEQ ED NO:175, SEQ ID NO:176, SEQ ID NO:177, 
SEQ IDNO:178, SEQ ID NO:179, SEQ EDNO:180, SEQ IDNO:181, SEQ ID NO:182, 
SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, 
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SEQ JD NO:188, SEQ ID NO:189, SEQ JD NO:190, SEQ ID N0:191, SEQ ID NO:199, 
SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ E) NO:196, SEQ ID NO:197. 
SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, 
SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, 
5 SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:211, SEQ ID NO:212, 
SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, 
SEQ ID NO:218. SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, 
SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, 
SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232. 

10 SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, 
SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID 
NO:242. The binding agent is preferably a purified antibody that recognizes an epitope 
present on a kinase polypeptide of the invention. Othei- binding agents include molecules 
that bind to kinase polypeptides and analogous molecules that bind to a kinase 

15 polypeptide. Such binding agents may be identified by using assays that measure kinase 
binding partner activity, such as those that measure PDGFR activity. 

The invention also features a method for screening for human cells containing a 
kinase polypeptide of the invention or an equivalent sequence. The method involves 
identifying the novel polypeptide in human cells using techniques that are routine and 

20 standard in the art, such as those described herein for identifying the kinases of the 

invention (e.g., cloning, Southern or Northern blot analysis, in situ hybridization, PGR 
amplification, etc.). 

In an eighth aspect, the invention features methods for identifying a substance that 
modulates kinase activity comprising the steps of: (a) contacting a kinase polypeptide 

25 selected fix)m the group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ DD NO:124, 
SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129. 
SEQ ID NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, 
SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, 
SEQ ID NO:140, SEQ ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, 

30 SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149. 
SEQ ID NO:150, SEQ ID N0:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154. 
SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159. 
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SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, 
SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, 
SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, 
SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, 

5 SEQ ID NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ E) NO:183, SEQ ID NO:184, 
SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, 
SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, 
SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199. 
SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 

10 SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, 
SEQ ID NO:210, SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, 
SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, 
SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, 
SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, 

15 SEQIDNO:230,SEQIDNO:231,SEQIDNO:232,SEQIDNO:233, SEQIDNO:234, 
SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237. SEQ ID NO:238. SEQ ID NO:239, 
SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242 with a test substance; (b) 
measuring the activity of said polypeptide; and (c) determining whether said substance 
modulates the activity of said polypeptide. 

20 The tenn "modulates" refers to the ability of a compound to alter the function of a 

kinase of the invention. A modulator preferably activates or inhibits the activity of a 
kinase of the invention. 

The term "activates" refers to increasing the cellular activity of the kinase. The 
term inhibit refers to decreasing the cellular activity of the kinase. Kinase activity is 

25 preferably the interaction with a natural binding partner. 

The term "modulates" also refers to altering the function of kinases of the 
invention by increasing or decreasing the probability that a complex forms between the 
kinase and a natural binding partner. A modulator preferably increases the probability that 
such a complex forms between the kinase and the natural binding partner, more preferably 

30 increases or decreases the probability that a complex forms between the kinase and the 
natural binding partner depending on the concentration of the compound exposed to the 
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kinase, and most preferably decreases the probability that a complex forms between the 
kinase and the natural binding partner. 

The term "complex*' refers to an assembly of at least two molecules bound to one 
another. Signal transduction complexes often contain at least two protein molecules 
5 boimd to one another. For instance, a protein tyrosine receptor protein kinase, GRB2, 
SOS, RAF, and RAS assemble to form a signal transduction complex in response to a 
mitogenic ligand. 

The term **natural binding partnef' refers to polypeptides, lipids, small molecules, 
or nucleic acids that bind to kinases in cells. A change in the interaction between a kinase 

10 and a natural binding partner can manifest itself as an increased or decreased probability 
that the interaction forms, or an increased or decreased concentration of kinase/natural 
binding partner complex. 

The temi "contacting" as used herein refers to mixing a solution comprising the 
test compound with a liquid medium bathing the cells of the methods. The solution 

15 comprising the compound may also comprise another component, such as dimethyl 

sulfoxide (DMSO), which facilitates the uptake of the test compound or compounds into 
the cells of the methods. The solution comprising the test compound may be added to the 
medium bathing the cells by utiliadng a delivery apparatus, such as a pipet-based device or 
syringe-based device. 

20 In a ninth aspect, the invention features methods for identifying a substance that 

modulates kinase activity in a cell comprising the steps of: (a) expressing a kinase 
polypeptide in a cell, wherein said polypeptide is selected from the group consisting of 
SEQ ED NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, 
SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, 

25 SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, 
SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, 
SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, 
SEQ IDNO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:151, 
SEQ ID NO:152, SEQ ID NO:153, SEQ ED NO:154, SEQ ID NO:155, SEQ ID NO:156, 

30 SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161, 
SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, 
SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171, 



WEST 



wo 00/73469 



PCTAJSOO/14842 



SEQ ID NO:172, SEQ TD NO:173, SEQ ID NO:I74, SEQ ID NO:175, SEQ ID NO:176, 
SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , 
SEQ ID NO: 182, SEQ ID NO: 183. SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID N0:1 86, 
SEQ ID N0:1 87, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID N0:191, 
5 SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, 
SEQ ID NO:197, SEQ IDNO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, 
SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204. SEQ ID NO:205, SEQ ID NO:206, 
SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209. SEQ ID NO:210, SEQ ID N0:21 1, 
SEQ ID NO:212, SEQ ID N0:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, 

10 SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, 
SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, 
SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, 
SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, 
SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, 

1 5 and SEQ ID NO:242; (b) adding a test substance to said cell; and (c) monitoring a change 

in cell phenotype or the interaction between said polypeptide and a natural binding 
partner. 

The term "expressing" as used herein refers to the production of kinases of the 
invention from a nucleic acid vector containing kinase genes within a cell. The nucleic 
20 acid vector is transfected into cells using well known techniques in the art as described 
herein. 

In a tenOi aspect, the invention provides methods for treating a disease or abnormal 
condition by administoing to a patient in need of such treatment a substance that 
modulates the activity of a polypeptide selected from die group consisting of SEQ ID 

25 NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ED 
NO:127, SEQ IDNO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID 
NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ED NO:136, SEQ ID 
NO:137, SEQ IDNO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID 
NO:142, SEQ ED NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 

30 NO:147, SEQ IDNO:148, SEQ IDNO:149, SEQ ID NO:150, SEQ ID N0:151, SEQ ID 
NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID 
NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161. SEQ ID 
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NO:162, SEQ DD NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID 
NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ID 
NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID 
NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID NO:181, SEQ ID 

5 NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID 
NO:187, SEQ ID NO:188, SEQ ID N0:189, SEQ ID NO:190, SEQ ID NO:191, SEQ ID 
NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID 
NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ED NO:204. SEQ ID NO:205, SEQ ID NO:206, SEQ ID 

10 NO:207, SEQ E) NO:208, SEQ ID NO:209, SEQ ID N0;21 0, SEQ ID N0:21 1 , SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 

15 NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242. Preferably, the disease is selected from the group consisting of immune- 
related diseases and disorders, cardiovascular disease, nexuodegenerative disorders, and 
cancer. Also included are metabolic disorders, such as diabetes mellitus, and reproductive 

20 disorders, such as infertility. 

Preferably, the disease or disorder is selected from the group consisting of 
Aeumatoid arthritis, artherosclerosis, autoimmune disorders, and organ transplantation. 
Preferably the disease or disorder is selected from the group consisting of immune-related 
diseases and disorders, myocardial infarction, cardiomyopathies, stroke, renal failure, and 

25 oxidative stress-related neurodegenerative disorders. Most preferably, the immune-related 
diseases and disorders are selected from the group consisting of rheumatoid arthritis, 
chronic inflammatory bowel disease, chronic inflammatory pelvic disease, multiple 
sclerosis, asthma, osteoarthritis, psoriasis, atherosclerosis, rhinitis, autoimmunity, and 
organ transplantation. 

30 Substances useful for treatment of disorders or diseases preferably show^ positive 

resuhs in one or more in vitro assays for an activity corresponding to treatment of the 
disease or disorder in question Substances that modulate the activity of the polypeptides 
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preferably include, but are not limited to, antisense oligonucleotides and inhibitors of 
protein kinases. 

The tenn "preventing" refers to decreasing the probability that an organism 
contracts or develops an abnormal condition. 
5 The term "treating" refers to having a therapeutic effect and at least partially 

alleviating or abrogating an abnormal condition in the organism. 

The term "therapeutic effect" refers to the inhibition or activation factors causing 
or contributing to the abnonnal condition. A therapeutic effect reUeves to some extent one 
or more of the symptoms of the abnormal condition. In reference to the treatment of 
10 abnormal conditions, a therapeutic effect can refer to one or more of the following: (a) an 
increase in the proliferation, growth, and/or differentiation of cells; (b) inhibition (/.e, 
slowing or stopping) of cell death; (c) inhibition of degeneration; (d) relieving to some 
extent one or more of the symptoms associated with the abnormal condition; and (e) 
enhancing the fimction of the affected population of cells. Compounds demonstrating 
15 efficacy against abnormal conditions can be identified as described herein. 

The term "abnonnal condition" refers to a fimction in the cells or tissues of an 
organism that deviates from their normal functions in that organism. An abnomial 
condition can relate to cell proliferation, cell differentiation or cell survival. An abnormal 
condition may also include irregularities in cell cycle progression, i.e., irregularities in 
20 normal cell cycle progression through mitosis and meiosis. 

Abnormal cell proliferative conditions include cancers such as fibrotic and 
mesangial disorders, abnormal angiogenesis and vasculogenesis, woimd healing, psoriasis, 
diabetes melUtus, and inflammation. 

Abnormal differentiation conditions include, but are not Umited to 
25 neiffodegenerative disorders, slow wound healing rates, and slow tissue grafting healing 
rates. 

Abnormal cell survival conditions relate to conditions in which programmed cell 
death (^optosis) pathways are activated or abrogated. A number of protein kinases are 
associated with the apoptosis pathways. Aberrations in the function of any one of the 
30 protein kinases could lead to cell immortality or premature cell death. 
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The term "aberration", in conjunction with the function of a kinase in a signal 
transduction process, refers to a kinase that is over- or under-expressed in an organism, 
mutated such that its catalytic activity is lower or higher than wild-type protein kinase 
activity, mutated such that it can no longer interact with a natural binding partner, is no 

5 longer modified by another protein kinase or protein phosphatase, or no longer interacts 
with a natural binding partner. 

The tenn "administering" relates to a method of incorporating a compound into 
cells or tissues of an organism. The abnormal condition can be prevented or treated when 
the cells or tissues of the organism exist within the organism or outside of the organism. 

10 Cells existing outside the organism can be maintained or grown in cell culture dishes. For 
cells haibored within the organism, many techniques exist in the art to administer 
compounds, including (but not limited to) oral, parenteral, dermal, injection, and aerosol 
applications. For cells outside of the organism, multiple techniques exist in the art to 
administer the compounds, including (but not limited to) cell microinjection techniques, 

1 5 transformation techniques, and carrier techniques. 

The abnormal condition can also be prevented or treated by administering a 
compound to a group of cells having an aberration in a signal transduction pathway to an 
organism. The effect of administering a compound on organism function can then be 
monitored. The organism is preferably a mouse, rat, rabbit, guinea pig, or goat, more 

20 preferably a monkey or ape, and most preferably a human. 

In an eleventh aspect, the invention features methods for detection the expression 
of a polypeptide in a sample as a diagnostic tool for diseases or disorders, wherein the 
method comprises the steps of: (a) contacting the sample with a nucleic acid probe which 
hybridizes under hybridization assay conditions to a nucleic acid target region of a kinase 

25 polypeptide selected fi-om the group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ 
ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ 
ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID NO:132, SEQ ID NO:133, SEQ 
ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ 
ID NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ 

30 ID NO: 144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 1 47, SEQ ID NO: 148, SEQ 
ID NO:149, SEQ IDNO:150, SEQ ID N0:151, SEQ ID NO:152, SEQ ID NO:153, SEQ 
ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156. SEQ ID NO: 157, SEQ ID NO: 158, SEQ 
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ID NO:159, SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID NO:163, SEQ 
ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ 
ID NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ 
ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ 

5 ID NO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ 
ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ 
ID NO:189, SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199. SEQ ID NO:193, SEQ 
ID NO:194, SEQ ID NO:l95, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ 
ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202. SEQ ID NO:203, SEQ 

10 ID N0:2CM, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ 
ID NO:209, SEQ ID NO:210, SEQ ID N0:211, SEQ ID NO:212, SEQ ID NO:213, SEQ 
ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ 
ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ 
ID NO:224, SEQ ID NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ 

15 ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ 
ID NO:234, SEQ ID NO:235, SEQ ID NO:236. SEQ ID NO:237, SEQ ID NO:238. SEQ 
ID NO:239, SEQ ID NO:240, SEQ ED NO:241, and SEQ ID NO:242, said probe 
comprising the nucleic acid sequence encoding the polypeptide, fragments thereof, and the 
complements of the sequences and firagments; and (b) detecting the presence or amount of 

20 the probe:target region hybrid as an indication of the disease. 

In preferred embodiments of the invention, the disease or disorder is selected from 
flie group consisting of rheumatoid arthritis, artherosclerosis, autoimmune disorders, organ 
transplantation, myocardial infarction, cardiomyopathies, stroke, renal failure, oxidative 
stress-related neurodegenerative disorders, metabolic disorder including diabetes, 

25 reproductive disorders including infertility, and cancer. 

The kinase "target region" is a nucleotide base sequence selected from the group 
consisting of those set forth in SEQ ID N0:1, SEQ ID N0:2, SEQ ID N0:3, SEQ ID 
N0:4, SEQ ID N0:5, SEQ ID N0:6, SEQ ID N0:7, SEQ ID N0:8, SEQ ID N0:9, SEQ 
ID NO:10, SEQ ID N0:11, SEQ ID N0:12, SEQ ID N0:13, SEQ ID N0:14, SEQ ID 

30 N0:15, SEQ ID N0:16, SEQ BD N0:17, SEQ ID N0:18, SEQ ID N0:19, SEQ ID NO:20, 
SEQ ID NO:21, SEQ ID NO:22. SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ 
ID NO:26, SEQ ID NO:27. SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID 



WEST 



wo 00/73469 



64 



PCTAJSOO/14842 



N0:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, 
SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39. SEQ ID NO:40, SEQ ID N0:41. SEQ 
ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID 
NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50. SEQ ID N0:51, SEQ ID NO:52, 
5 SEQ ID NO:53. SEQ ID NO:54, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ 
ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID N0:61, SEQ ID NO:62, SEQ ID 
NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ED NO:68, 
SEQ ID NO:69. SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:72, SEQ ED NO:73, SEQ 
ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID 

10 NO:79, SEQ ID NO:80, SEQ ID N0:81, SEQ ID NO:82, SEQ ID NO:83, SEQ ID NO:84, 
SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID NO:88, SEQ ID NO:89. SEQ 
ID NO:90, SEQ ID N0:91 , SEQ ID NO:92, SEQ ID NO:93. SEQ ID NO:94, SEQ ID 
NO:95, SEQ ID NO:96, SEQ ID NO;97, SEQ ID NO:98, SEQ ID NO:99, SEQ ID 
NO:100, SEQ ID NO:101, SEQ ID NO:102, SEQ ID NO:103, SEQ ID NO:104, SEQ ID 

15 NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108, SEQ ID NO:109, SEQ ID 
NO:110, SEQ ID N0:111, SEQ ID N0:112, SEQ ID N0:1 13, SEQ ID NO:l 14, SEQ ID 
N0:115, SEQ ID N0:116, SEQ ID N0:117, SEQ ID N0:1 18, SEQ ID NO:l 19, SEQ ID 
NO:120, and SEQ IDN0:121, ortbe corresponding full-length sequences, a ftmctional 
derivative thereof, or a fragment thereof to which the nucleic acid probe will specifically 

20 hybridize. Specific hjrbridization indicates that in the presence of other nucleic acids the 
probe only hybridizes detectably with the kinase of the invention's target region. Putative 
target regions can be identified by methods well known in the art consisting of alignment 
and comparison of the most closely related sequences in the database. 

In preferred embodiments the nucleic acid probe hybridizes to a kinase target 

25 region encoding at least 6, 12, 75, 90, 105, 120, 1 50, 200, 250, 300 or 350 contiguous 
amino acids of the sequence set forth in SEQ ID NO:122. SEQ ID NO: 123, SEQ ID 
NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID 
NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID 
NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID 

30 NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID 
NO:144, SEQ ID NO:I45, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID 
NO:149, SEQ ID NO:150, SEQ IDN0:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID 



WEST 



wo 00/73469 



65 



PCTAJSOO/14842 



NO:154, SEQ ID NO:155, SEQ IDNO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID 
NO:159, SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID N0:163, SEQ ID 
NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID 
NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID 
NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ K) 
NO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID 
NO:184, SEQ ID NO:185, SEQ ID NO:186. SEQ ID NO:187, SEQ ID NO:188, SEQ ID 
NO:189, SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID 
NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ED NO:197, SEQ ID NO:198, SEQ ID 
NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID 
NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID 
NO:209, SEQ ID NO:210, SEQ ID N0:211, SEQ ID NO:212, SEQ ID NO:213, SEQ ID 
NO:214. SEQ ID NO:2I5, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID 
NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID 
NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID 
NO:229, SEQ ID NO:230. SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233. SEQ ID 
NO:234. SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID 
NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, or the corresponding 
full-length amino acid sequence, or a functional derivative thereof. Hybridization 
conditions should be such that hybridization occurs only with the kinase genes in the 
presence of other nucleic acid molecules. Under stringent hybridization conditions only 
highly complementary nucleic acid sequences hybridize. Preferably, such conditions 
prevent hybridization of nucleic acids having more than 1 or 2 mismatches out of 20 
contiguous nucleotides. Such conditions are defined supra. 

Hybridization conditions should be such that hybridization occurs only with the 
genes in the presence of other nucleic acid molecules. Under stringent hybridization 
conditions only highly complementary nucleic acid sequences hybridize. Preferably, such 
conditions prevent hybridization of nucleic acids having 1 or 2 mismatches out of 20 
contiguous nucleotides. Such conditions are defined supra. 

The diseases for which detection of kinase genes in a sample could be diagnostic 
include diseases in which kinase nucleic acid (DNA and/or RNA) is amplified in 
comparison to normal cells. By "amplification" is meant increased numbers of kinase 
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DNA or RNA in a cell compared with normal cells. In normal cells, kinases are typically 
found as single copy genes. In selected diseases, the chromosomal location of the kinase 
genes may be amplified, resulting in multiple copies of the gene, or amplification. Gene 
amplification can lead to amplification of kinase RNA, or kinase RNA can be amplified in 

5 the absence of kinase DNA amplification. 

"Amplification" as it refers to RNA can be the detectable presence of kinase RNA 
in cells, since in some normal cells there is no basal expression of kinase RNA. In other 
normal cells, a basal level of expression of kinase exists, therefore in these cases 
amphfication is the detection of at least 1-2-fold, and preferably more, kinase RNA, 

1 0 compared to the basal level . 

The diseases that could be diagnosed by detection of kinase nucleic acid in a 
sample preferably include cancers. The test samples suitable for nucleic acid probing 
methods of the present invention include, for example, cells or nucleic acid extracts of 
cells, or biological fluids. The samples used in the above-described methods will vary 

15 based on the assay format, the detection method and the nature of the tissues, cells or 
extracts to be assayed. Methods for preparing nucleic acid extracts of cells are well 
known in the art and can be readily adapted in order to obtain a sample that is compatible 
with the method utilized. 

Another aspect of the invention involves a method of agonizing (stimulating) or 

20 antagonizing a target of the invention and a natural binding partner associated activity in a 
mammal comprising administering to said mammal an agonist or antagonist to one of the 
above disclosed polypeptides in an amount sufficient to effect said agonism or 
antagonism. A method of treating diseases in a mammal with an agonist or antagonist of 
the protein of the present invention activity comprising administering the agonist or 

25 antagonist to a mammal in an amount sufficient to agonize or antagonize associated 
functions is also encompassed in the present application. 

In an effort to discover novel treatments for diseases, biomedical researchers and 
chemists have designed, synthesized, and tested molecules that inhibit the function of 
protein polypeptides. Some small organic molecules form a class of compounds that 

30 modulate the fimction of protein polypeptides. Examples of molecules that have been 
reported to inhibit the function of protein kinases include, but are not limited to, bis 
monocyclic, bicyclic or heterocyclic aryl compounds (PCT WO 92/20642, published 
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November 26, 1992 by Maguire et aL), vinylene-azaindole derivatives (PCT WO 
94/14808, published July 7, 1994by Ballinari et aL), l-cyclopropyl-4-pyridyl-quinolones 
(U.S. Patent No. 5,330,992), styryl compounds (U.S. Patent No. 5,217,999), styryl- 
substituted pyridyl compounds (U.S. Patent No, 5,302,606), certain quinazoline 
5 derivatives (EP Application No. 0 566 266 Al), seleoindoles and selenides (PCT WO 
94/03427, published February 17, 1994 by Denny et al), tricyclic polyhydroxylic 
compounds (PCT WO 92/21660, published December 10, 1992 by Dov^), and 
benzylphosphonic acid compounds (PCT WO 91/15495, published October 17, 1991 by 
Dow et al)y all of which are incorporated by reference herein, including any drawings. 

10 Compounds that can traverse cell membranes and are resistant to acid hydrolysis 

are potentially advantageous as therapeutics as they can become highly bioavailable after 
being administered orally to patients. However, many of these protein inhibitors only 
weakly, inhibit function. In addition, many inhibit a variety of protein kinases and will 
therefore cause multiple side-effects as therapeutics for diseases. 

15 Some indolinone compounds, however, form classes of acid resistant and 

membrane permeable organic molecules. WO 96/22976 (published August 1, 1996 by 
Ballinari et al.) describes hydrosoluble indolinone compounds that harbor tetralin, 
naphthalene, quinoline, and indole substituents fused to the oxindole ring. These bicyclic 
substituents are in turn substituted with polar groups including hydroxylated alkyl, 

20 phosphate, and ether substituents. U.S. Patent Application Serial Nos, 08/702,232, filed 
August 23, 1996, entitled "Indolinone Combinatorial Libraries and Related Products and 
Methods for the Treatment of Disease" by Tang et al. (Lyon & Lyon Docket No. 
221/187) and 08/485,323, filed June 7, 1995, entitled ''BenzyUdene-Z-Indoline 
Compounds for the Treatment of Disease" by Tang et aL (Lyon & Lyon Docket No. 

25 223/298) and International Patent Publication WO 96/22976, published August 1 , 1 996 by 
Ballinari et aL^ all of which are incorporated herein by reference in their entirety, 
including any dravmgs, describe indolinone chemical libraries of indolinone compounds 
harboring other bicyclic moieties as well as monocycUc moieties fused to the oxindole 
ring. AppUcations 08/702,232, filed August 23, 1996, entitled "Indolinone Combinatorial 

30 Libraries and Related Products and Methods for the Treatment of Disease" by Tang et al 
(Lyon & Lyon Docket No. 221/187), 08/485,323, filed June 7, 1995, entitled 
"BenzyUdene-Z-Indoline Compounds for the Treatment of Disease" by Tang et aL (Lyon 
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& Lyon Docket No. 223/298), and WO 96/22976, published August 1, 1996 by Ballinari 
et al. teach methods of indolinone synthesis, methods of testing the biological activity of 
indolinone compounds in cells, and inhibition patterns of indolinone derivatives, both of 
which are incorporated by reference herein, including any drawings. 
5 Other examples of substances capable of modulating kinase activity include, but 

are not limited to, tyrphostins, quinazolines, quinoxolines, and quinolines. The 
quinazolines, tyrphostins, quinolines, and quinoxolines referred to above include well 
known compoimds such as those described in the hterature. For example, representative 
pubhcations describing quinazolines include Barker et al., EPO Publication No. 0 520 722 

10 Al; Jones et al., U.S. Patent No. 4,447,608; Kabbe et al., U.S. Patent No. 4,757,072; Kaul 
and Vougioukas, U.S. Patent No. 5, 3 16,553; Kreighbaum and Comer, U.S. Patent No. 
4,343,940; Pegg and Wardleworth, EPO Publication No. 0 562 734 Al; Barker et al, Proc. 
of Am. Assoc. for Cancer Research 32:327 (1991); Bertino, J.R., Cancer Research 3:293- 
304 (1979); Bertino, J.R., Cancer Research 9(2 part l):293-304 (1979); Curtm et al., Br. J. 

15 Cancer 53:361-368 (1986); Femandes et al.. Cancer Research 43: 11 17-1 123 (1983); Ferris 

et al. J. Org. Chem. 44(2): 173-1 78; Fry et al.. Science 265:1093-1095 (1994); Jackman et 
al.. Cancer Research 51:5579-5586 (1981); Jones et al. J. Med. Chem, 29(6): 1 1 14-1118; 
Lee and Skibo, Biochemistry 26(23):7355-7362 (1987); Lemus et al., J. Org. Chem. 
54:3511-3518 (1989); Ley and Seng, Synthesis 1975:415-522 (1975); Maxwell et al., 

20 Magnetic Resonance in Medicine 17:189-196 (1991); Mini et al.. Cancer Research 

45:325-330 (1985); Phillips and Castle, J. Heterocyclic Chem. 17(19):1489-1596 (1980); 
Reece et al., Cancer Research 47(1 1):2996-2999 (1977); Sculier et al.. Cancer Immunol, 
and hnmunother. 23:A65 (1986); Sikora et al.. Cancer Letters 23:289-295 (1984); and 
Sikora et al,. Analytical Biochem. 172:344-355 (1988), all of which are incorporated 

25 herein by reference in their entirety, including any drawings. 

Quinoxaline is described in Kaul and Vougioukas, U.S. Patent No. 5,316,553, 
incorporated herein by reference in its entirety, including any drawings. 

Quinolines are described in DoUe et al., J. Med. Chem. 37:2627-2629 (1994); 
MaGuire, J. Med. Chenu 37:2129-2131 (1994); Burke et al., J. Med. Chem. 36:425-432 

30 (1993); and Buike et al. BioOrganic Med. Chem. Letters 2:1771-1774 (1992), all of which 
are incorporated by reference in their entirety, including any drawings. 
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Tyrphostins are described in Allen et al., Clin. Exp. Immunol 91:141-156 (1993); 
Anafi et al., Blood 82:12:3524-3529 (1993); Baker et al., J. Cell Sci. 102:543-555 (1992); 
Bilder et al, Amer. PhvsioL Soc. pp. 6363-6143:C721-C730 (1991); Bnrnton et al., 
Proceedings of Amer. Assoc. Cancer Rsch. 33:558 (1992); Bryckaert et al.. Experimental 

5 Cell Research 199:255-261 (1992); Dong et al., J. Leukocyte Biology 53:53-60 (1993); 
Dong et al., J. Immunol. 151(5):2717-2724 (1993); Gazit et al., J. Med. Chem. 32:2344- 
2352 (1989); Gazit et al., " J. Med. Chem. 36:3556-3564 (1993); Kaur et al., Anti-Cancer 
Drugs 5:213-222 (1994); Kaur et al., King et al., Biochem. J. 275:413-418 (1991); Kuo et 
al., Cancer Letters 74:197-202 (1993); Levitzki, A., The FASEBJ. 6:3275-3282 (1992); 

10 Lyall et al„ L Biol. Chem. 264:14503-14509 (1989); Peterson et al.. The Prostate 22:335- 
345 (1993); Pillemer et al.. Int. J. Cancer 50:80-85 (1992); Posner et al., Molecular 
Pharmacology 45:673-683 (1993); Rendu et al.. Biol. Pharmacology 44(5):881-888 
(1992); Sauro and Thomas, Life Sciences 53:371-376 (1993); Sauro and Thomas, L 
Pharm. and Experimental Therapeutics 267(3):1 19-1 125 (1993); Wolbring et al., J. Biol. 

15 Chem. 269(36):22470-22472 (1994); and Yoneda et al., Cancer Research 51 :4430-4435 

(1991); all of which are incorporated herein by reference in their entirety, including any 
drawings. 

Other compounds that could be used as modulators include oxindolinones such as 
those described in U.S. patent application Serial No. 08/702,232 filed August 23, 1996, 

20 incorporated herein by reference in its entirety, including any drawings. 
Methods of Treating a Disease fEnablement - i .e.. Dosing) 

Methods of determining the dosages of compounds to be administered to a patient 
and modes of administering compounds to an organism are disclosed in U.S. Application 
Serial No. 08/702,282, filed August 23, 1996 and International patent publication number 

25 WO 96/22976, pubhshed August 1 1 996, both of which are incorporated herein by 

reference in their entirety, including any drawings, figures or tables. Those skilled in the 
art will appreciate that such descriptions are applicable to the present invention and can be 
easily ad^ted to it. 

The proper dosage depends on various factors such as the type of disease being 
30 treated, the particular composition being used and the size and physiological condition of 
the patient. Therapeutically effective doses for the compounds described herein can be 
estimated initially from cell culture and animal models. For example, a dose can be 
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formulated in animal models to achieve a circulating concentration range that initially 
takes into account the IC50 as determined in cell culture assays. The animal model data 
can be used to more accurately determine useful doses in humans. 

Plasma half-life and biodistribution of the dmg and metabolites in the plasma, 

5 tumors and major organs can also be determined to facilitate the selection of drugs most 
appropriate to inhibit a disorder. Such measurements can be carried out. For example, 
HPLC analysis can be performed on the plasma of animals treated with the drug and the 
location of radiolabeled compounds can be deter-mined using detection methods such as 
X-ray, CAT scan and MRI. Compounds that show potent inhibitory activity in the 

10 screening assays, but have poor pharmacokinetic characteristics, can be optimized by 

altering the chemical structure and retesting. In this regard, compounds displaying good 
pharmacokinetic characteristics can be used as a model. 

Toxicity studies can also be carried out by measuring the blood cell composition. 
For example, toxicity studies can be carried out in a suitable animal model as follows: 

15 1) the compound is administered to mice (an untreated control mouse should also be 

used); 2) blood samples are periodically obtained via the tail vein from one mouse in each 
treatment group; and 3) the samples are analyzed for red and white blood cell counts, 
blood cell composition and the percent of lymphocytes versus polymorphonuclear cells. A 
comparison of results for each dosing regime with the controls indicates if toxicity is 

20 present. 

At the termination of each toxicity study, ftirther studies can be carried out by 
sacrificing the animals (preferably, in accordance vwth the American Veterinary Medical 
Association guidelines Report of the American Veterinary Medical Assoc. Panel on 
Euthanasia, Journal of American Veterinarv Medical Assoc., 202:229-249, 1993). 

25 Representative animals from each treatment group can then be examined by gross 
necropsy for immediate evidence of metastasis, unusual illness or toxicity. Gross 
abnormahties in tissue are noted and tissues are examined histologically. Compounds 
causing a reduction in body weight or blood components are less preferred, as are 
compounds having an adverse effect on major organs. In general, the greater the adverse 

30 effect the less preferred the compound. 
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For the treatment of cancers the e?q)ected daily dose of a hydrophobic 
pharmaceutical agent is between 1 to 500 mg/day, preferably 1 to 250 mg/day, and most 
preferably 1 to 50 mg/day. Drugs can be delivered less frequently provided plasma levels 
of the active moiety are sufficient to maintain therapeutic effectiveness. 
5 Plasma levels should reflect the potency of the drug. Generally, the more potent 

the compoimd the lower the plasma levels necessary to achieve eCBcacy. 

In a final aspect, the invention features a method for detection of a kinase 
polypeptide in a sample as a diagnostic tool for a disease or disorder, wherein the method 
comprises: (a) comparing a nucleic acid target region encoding the kinase polypeptide in 

10 a sample, where the kinase polypeptide is selected from the group consisting of SEQ ID 
NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126. SEQ ID 
NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131. SEQ ID 
NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ED NO:136, SEQ ID 
NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID 

15 NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID N0:151, SEQ ID 
NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID 
NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ED NO:160, SEQ ID N0:161, SEQ ID 
NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID 

20 NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ID, 
NO:172, SEQ ID NO:173, SEQ ED NO:174, SEQ ED NO:175, SEQ ID NO:176, SEQ ID 
NO:177, SEQ ID NO:178. SEQ ID NO:179, SEQ ID NO:180. SEQ ID NO:181. SEQ ED 
NO:182, SEQ ED NO:183, SEQ ID NO:184, SEQ ED NO:185, SEQ ID NO:186, SEQ ID 
NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID NO:191, SEQ ED 

25 NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, SEQ ID 
NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ED NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 
NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:2I1, SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ED NO:216, SEQ ID 

30 NO:217, SEQ ID NO:218, SEQ ED NO:219, SEQ ED NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1. SEQ ID 
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NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242, or one or more fragments thereof, with a control nucleic acid target region 
encoding the kinase polypeptide, or one or more fragments thereof; and (b) detecting 

5 differences in sequence or amount between the target region and the control target region, 
as an indication of the disease or disorder. Preferably, the disease or disorder is selected 
from the group consisting of immune-related diseases and disorders, organ transplantation, 
myocardial infarction, cardiovascular disease, stroke, renal failure, oxidative stress-related 
neurodegenerative disorders, and cancer. Immune-related diseases and disorders include, 

10 but are not limited to, those discussed previously. 

The term "comparing" as used herein refers to identifying discrepancies between 
the nucleic acid target region isolated from a sample, and the control nucleic acid target 
region. The discrepancies can be in the nucleotide sequences, e.g. insertions, deletions, or 
point mutations, or in the amount of a given nucleotide sequence. Methods to determine 

15 these discrepancies in sequences are well-known to one of ordinary skill in the art. The 
"control" nucleic acid target region refers to the sequence or amount of the sequence 
found in normal cells, e.g. cells that are not diseased as discussed previously. 
The term also includes anti-sense molecules drawn thereto. 
The invention has been described broadly and generically herein. Each of the 

20 narrower species and subgeneric groupings falling within the generic disclosure also form , 
part of the invention. This includes the generic description of the invention with a proviso 
or negative Ihnitation removing any subject matter from the genus, regardless of whether 
or not the excised material is specifically recited herein. For example, in some instances 
the nucleotide sequence of particular kinase polypeptides may not be part of a preferred 

25 embodiment. 

The sunmiary of the invention described above is not limiting and other features 
and advantages of the invention will be apparent from the following detailed description of 
the invention, and from the claims. 
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BRIEF DESCRIPTION OF THE FIGURES 
Figures lA to IBB shows the amino acid sequences of SEQ ID NO:122, SEQ ID 
NO:123, SEQ ID NO:124, SEQ ID NO:125. SEQ ID NO:126, SEQ ID NO:127, SEQ ID 
N0:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131. SEQ ID NO:132, SEQ ID 
5 NO:133, SEQ roNO:134, SEQ ID NO:135, SEQID NO:136, SEQ ID NO:137, SEQ ED 
NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ ID NO:142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQID NO:146. SEQ ID NO:147, SEQ ID 
NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID 
NO:153. SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID 

10 NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161. SEQ ID NO:162, SEQ ID 
NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID 
NO:168, SEQ ID NO:169, SEQ ED NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID 
NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176. SEQ ID NO:177, SEQ ID 
NO:178, SEQ IDNO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID 

15 NO:183, SEQ IDNO:184, SEQIDNO:185, SEQ IDNO:186, SEQ ID NO:187, SEQ ID 
NO:188, SEQ IDNO:189, SEQ IDNO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID 
NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID 
NO:198, SEQ ID NO:199, SEQ ID NO.200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 

20 NO:208, SEQ ID NO:209. SEQ ID NO:210, SEQ ID N0:211, SEQ ID NO:212, SEQ ID . 
NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ED NO:216, SEQ ID NO:217, SEQ ID 
N0:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID 

25 NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ m 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. 

Figures 2A to 2MMMM shows the nucleic acid sequences of SEQ ID N0:1, SEQ 
ID N0:2, SEQ ID NO:3, SEQ ID N0:4, SEQ ID N0:5, SEQ ED N0:6, SEQ ID NO:7, 
SEQ ID N0:8, SEQ ID N0:9, SEQ ID NO:10, SEQ ID N0:1 1, SEQ ID NO: 12. SEQ ID 

30 N0:13, SEQ ID N0:14, SEQ ID N0:15, SEQ ED N0:16, SEQ ID N0:17, SEQ ID N0:18, 
SEQ ED N0:19, SEQ ID NO:20, SEQ ID N0:21, SEQ ID N0:22, SEQ ID NO:23, SEQ 
BD NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID 
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NO:29, SEQ ID NO:30, SEQ ID N0:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ IDNO:38, SEQ ID NO:39, SEQ 
ID NO:40, SEQ ID N0:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID 
NO:45. SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, 
SEQ ID N0:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54. SEQ ID NO:55, SEQ 
ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID 
N0:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID NO:66, 
SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID Nd:70, SEQ ID N0:71, SEQ 
ID NO:72. SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID 
NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID N0:81, SEQ ED NO:82, 
SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ 
ID NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID N0:91, SEQ ID NO:92, SEQ ID 
NO:93, SEQ ED NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, 
SEQ ID NO:99, SEQ ID NO:100, SEQ ID NO:101, SEQ ID NO:I02, SEQ ID NO:103, 
SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID NO:108, 
SEQ ID NO:109, SEQ ID N0:1 10, SEQ ID N0:1 11, SEQ ID N0:112, SEQ ID NO:l 13, 
SEQ ID N0:114, SEQ ID N0:1 15, SEQ ID N0:1 16, SEQ ID NO:l 17, SEQ ID NO:l 18, 
SEQ IDN0:119, SEQ ID NO:120, and SEQ IDN0:121. 

DETAILED DESCRIPTION OF THE INVENTION 
The present invention relates in part to kinase polypeptides, nucleic acids encoding 
such polypeptides, cells containing such nucleic acids, antibodies to such polypeptides, 
assays utilizing such polypeptides, and methods relating to all of the foregoing. The 
present invention is based upon the isolation and characterization of new kinase 
polypeptides. The polypeptides and nucleic acids may be produced using well-known and 
standard synthesis techniques when given the sequences presented herein. 

L The Nucleic Acids of the Invention 

Included within the scope of this invention aie the functional equivalents of the 
herein-described isolated nucleic acid molecules. The degeneracy of the genetic code 
permits substitution of certain codons by other codons that specify the same amino acid 
and hence would give rise to the same protein. The nucleic acid sequence can vary 
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substantially since, with the exception of methionine and tryptophan, the known amino 
acids can be coded for by more than one codon. Thus, portions or all of the kinase genes 
of the invention could be synthesized to give a nucleic acid sequence significantly 
different fmm one selected from the group consisting of those set forth in SEQ ID NO: 1, 
5 SEQ ID N0:2. SEQ ID N0:3, SEQ ID N0:4, SEQ ID N0:5, SEQ ID NO:6, SEQ ID 
N0:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, SEQ ID N0:11 , SEQ ID N0:12, 
SEQ ID N0:13, SEQ ID N0:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID N0:17, SEQ 
ID NO: 18. SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ED 
NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26. SEQ ID NO:27, SEQ ID NO:28, 

10 SEQ ID NO:29, SEQ ID NO:30, SEQ ID N0:31, SEQ ID NO:32, SEQ ID NO:33, SEQ 
ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:37, SEQ ID NO:38, SEQ ID 
NO:39, SEQ ED NO:40, SEQ ID N0:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, 
SEQ ID NO:45, SEQ ID NO:46,'SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ 
DD NO:50, SEQ ID N0:51, SEQ ID NO:52. SEQ ID NO:53, SEQ ED NO:54, SEQ ID 

15 NO:55, SEQ ED NO:56, SEQ ID NO:57, SEQ ED NO:58, SEQ ID NO:59, SEQ ID N0:60, 
SEQ ID N0:61, SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ 
It) NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID 
N0:71, SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, 
SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID N0:81, SEQ 

20 ID NO:82, SEQ ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID 

Nb:87, SEQ ED NO:88, SEQ ID NO:89, SEQ ID NO:90, SEQ ID N0:91, SEQ ID NO:92, 
SEQ ID NO:93, SEQ ID NO:94. SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ 
ID NO:98, SEQ ID NO:99, SEQ ED NOrlOO, SEQ ID NO:101, SEQ ID NO:102, SEQ ID 
NO:103, SEQ ID NO:104, SEQ ID NO:105, SEQ ID NO:106, SEQ ID NO:107, SEQ ID 

25 NO: 108, SEQ ID NO: 1 09, SEQ ED NO: 1 1 0, SEQ ID NO: 1 11 . SEQ ID NO: 11 2, SEQ ID 
NO: 11 3, SEQ ID N0:1 14, SEQ ID NO: 1 15, SEQ ID NO: 1 16, SEQ ID NO: 1 17, SEQ ID 
NO: 11 8, SEQ K) NO:l 19. SEQ ID NO: 120, and SEQ ID NO: 121. The encoded amino 
acid sequence thereof would, however, be preserved. 

In addition, the nucleic acid sequence may comprise a nucleotide sequence which 

30 results from the addition, deletion or substitution of at least one nucleotide to the 5 '-end 
and/or the 3'-end of the nucleic acid sequence shown in SEQ ID NO:l. SEQ ID N0:2, 
SEQ ED N0:3, SEQ ID N0:4, SEQ ID N0:5, SEQ ID N0:6, SEQ ID NO:7, SEQ ID 
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N0:8, SEQ IDN0:9, SEQ E> NO:10, SEQ K) N0:11, SEQ ID N0:12, SEQ ID N0:13, 
SEQ ID N0:14, SEQ ID N0:15, SEQ ID N0:16, SEQ ID N0:17, SEQ ID N0:18, SEQ 
ID N0:19, SEQ ID NO:20, SEQ ID N0:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, 
5 SEQ ID NO:30, SEQ ID N0:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ 
ID NO:35, SEQ ID NO:36. SEQ ID NO:37, SEQ ID NO:38, SEQ ID NO:39, SEQ ID 
NO:40, SEQ ID N0:41, SEQ ID NO:42, SEQ ID NO:43, SEQ ID NO:44, SEQ ID NO:45, 
SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:48, SEQ ID NO:49, SEQ ID NO:50, SEQ 
ID N0:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:54, SEQ ID NO:55, SEQ ID 

10 NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61 , 
SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ E) NO:65, SEQ ID NO:66, SEQ 
ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID N0:71, SEQ ID 
NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, 
SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:80, SEQ ID N0:81, SEQ ID NO:82, SEQ 

15 ID NO:83, SEQ ID NO:84, SEQ ID NO:85, SEQ ID NO:86, SEQ ID NO:87, SEQ ID 

NO:88, SEQ ID NO:89. SEQ ID NO:90. SEQ ID NO:91, SEQ ID NO:92, SEQ ID NO:93, 
SEQ ID NO:94, SEQ ID NO:95, SEQ ID NO:96, SEQ ID NO:97, SEQ ID NO:98, SEQ 
ID NO:99, SEQ IDNO:100, SEQ ID NOrlOl. SEQ ID NO:102, SEQ ID NO:103, SEQ ID 
NO: 104, SEQ ID NO: 105, SEQ ID NO: 106, SEQ ID NO: 107, SEQ ID NO: 108, SEQ ID 

20 NO:109, SEQ ID NO:110, SEQ ID N0:111, SEQ IDN0:112, SEQ ID NO:113, SEQ ID 
N0:1 14, SEQ ID NO: 115, SEQ ID NO: 116, SEQ ID NO: 11 7, SEQ ID NO:l 18, SEQ ID 
N0:1 19, SEQ ID NO:120, and SEQ ID N0:121, or a derivative thereof. Any nucleotide 
or polynucleotide may be used in this regard, provided that its addition, deletion or 
substitution does not alter the amino acid sequence of SEQ ID NO:122, SEQ ID NO: 123, 

25 SEQ ID NO;124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, 
SEQ ID NO:129, SEQ IDNO:130, SEQ ID N0:131, SEQ ID NO:132, SEQ ID NO:I33, 
SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, 
SEQ ID NO:139, SEQ ID NO:140, SEQ ID NG:141, SEQ ID NO:142, SEQ ID NO:143, 
SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, 

30 SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:151. SEQ ID NO:152, SEQ ID NO:153, 
SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID NO: 157, SEQ ED NO: 158, 
SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID NO:163, 



WEST 



wo 00/73469 



77 



PCTAJSOO/14842 



SEQ E) NO: 164, SEQ E) NO: 165. SEQ JD NO: 166, SEQ ID NO: 167, SEQ ID NO: 168, 
SEQ ID NO:169. SEQ ID NO:170, SEQ ID N0:171, SEQ ED NO:172, SEQ ID NO:173, 
SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178. 
SEQ ID NO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID NO:183, 
5 SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, 
SEQ ID NO:189, SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, 
SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, 
SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, 
SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, 

10 SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:211, SEQ ID NO;212, SEQ ID NO:213, 
SEQ ID NO:214, SEQ ID N0:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, 
SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223. 
SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228. 
SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ED NO:233, 

15 SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, 
SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, that is encoded 
by the nucleotide sequence. For example, the present invention is intended to include any 
nucleic acid sequence resulting from the addition of ATG as an initiation codon at the 5'- 
end of the inventive nucleic acid sequence or its derivative, or from the addition of TTA, 

20 TAG or TGA as a termination codon at the 3'-end of the inventive nucleotide sequence or 
its derivative. Moreover, the nucleic acid molecule of the present invention may, as 
necessary, have restriction endonuclease recognition sites added to its 5*-end and/or 3'- 
end. 

Such functional alterations of a given nucleic acid sequence afford an opportunity 
25 to promote secretion and/or processing of heterologous proteins encoded by foreign 
nucleic acid sequences fused thereto, for example. All variations of the nucleotide 
sequence of the kinase genes of the invention and fragments thereof permitted by the 
genetic code are, therefore, included in this invention- 
Further, it is possible to delete codons or to substitute one or more codons with 
30 codons other than degenerate codons to produce a stmcturally modified polypeptide, but 
one which has substantially the same utility or activity as the polypeptide produced by the 
unmodified nucleic acid molecule. As recognized in tiie art, the two polypeptides are 
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functionally equivalent, as are the two nucleic acid molecules that give rise to their 
production, even though the differences between the nucleic acid molecules are not related 
to the degeneracy of the genetic code. This is discussed further in the "Functional 
Derivatives" section, herein. 

Finally, many of the nucleic acid molecules of the invention are provided as a 
partial sequence only (Fig. 2A through 2QQ). However, it is standard for one of ordinary 
skiU in the art to obtain a fiill-length sequence when provided with a partial sequence. 
Similarly, when provided with a partial or full-length sequence it is standard for one of 
ordinary skill in the art to obtain nucleic acid sequence coding for homologous proteins. 
Therefore, these nucleic acid molecules are also part of the invention. 

The characteristics of the protein kinase nucleic acid sequences of the invention are 
provided in Table 1. The protein kinases fall into 10 known groups: AGC, CAMK, CKI, 
CMGC, dsPK, EIFK, LIMK, MLK, STE and TK. In addition, there are a significant 
number of protein kinases that do not belong to any of the known groups, and therefore 
presumably define new protein kinase groups. 

Additional characteristics maybe found, inter alia, in the tables, namely Table 1, 
Table 2, Table 3 and Table 4, shown below. 

n. Nucleic Acid Probes. Methods, and Kits for Detection of Protein Kinases. 

A nucleic acid probe of the present invention may be used to probe an appropriate 
chromosomal or cDNA library by usual hybridization methods to obtain other nucleic acid 
molecules of the present invention. A chromosomal DNA or cDNA hVary may be 
prepared from appropriate cells according to recognized methods in the art (cf. "Molecular 
Cloning: A Laboratory Manual", second edition, Cold Spring Harbor Laboratory, 
Sambrook, Fritsch, & Maniatis, eds., 1989). 

In the alternative, chemical synthesis can be carried out in order to obtain nucleic 
acid probes having nucleotide sequences that correspond to N-tenninal, kinase or C- 
tenninal portions, for example, of the amino acid sequence of the polypeptide of interest. 
The synthesized nucleic acid probes may be used as primers in a polymerase chain 
reaction (PCR) carried out in accordance vA± recognized PGR techniques, essentially 
according to PCR Protocols, "A Guide to Methods and Applications", Academic Press, 



wo 00/73469 PCT/USOO/14842 

79 

Michael, et ai, eds., 1990, utilizing the appropriate chromosomal or cDNA library to 
obtain the fragment of the present invention. 

One skilled in the art can readily design such probes based on the sequence 
disclosed herein using methods of computer alignment and sequence analysis known in 
5 the art ("Molecular Cloning: A Laboratory Manual", 1989, ^swpra). The hybridization 
probes of the present invention can be labeled by standard labeling techniques such as 
with a radiolabel, enzyme label, fluorescent label, biotin-avidin label, chemiluminescence, 
and the like. After hybridization, the probes may be visualized using known methods. 
The nucleic acid probes of the present invention include RNA, as well as DNA 

10 probes, such probes being generated using techniques known in the art. The nucleic acid 
probe may be immobilized on a sohd support. Examples of such solid supports include, 
but . are not limited to, plastics such as polycarbonate, complex carbohydrates such as 
agarose and sepharose, and acrylic resins, such as polyacrylamide and latex beads. 
Techniques for coupling nucleic acid probes to such solid supports are well known in the 

15 art. 

The test samples suitable for nucleic acid probing methods of the present invention 
include, for example, cells or nucleic acid extracts of cells, or biological fluids. The 
samples used in the above-described methods will vary based on the assay format, the 
detection method and the nature of the tissues, cells or extracts to be assayed. Methods for 

20 preparing nucleic acid extracts of cells are well known in the art and can be readily 
adapted in order to obtain a sample that is compatible with the method utilized. 

One method of detecting the presence of nucleic acids of the invention in a sample 
comprises (a) contacting said sample with the above-described nucleic acid probe imder 
conditions such that hybridization occurs, and (b) detecting the presence of said probe 

25 bound to said nucleic acid molecule. One skilled in the art would select the nucleic acid 
probe according to techniques known in the art as described above. Samples to be tested 
include but should not be limited to RNA samples of human tissue. 

A kit for detecting the presence of nucleic acids of the invention in a sample 
comprises at least one container means having disposed therein the above-described 

30 nucleic acid probe. The kit may fixrther comprise other containers comprising one or more 
of the following: wash reagents and reagents enable of detecting the presence of bound 
nucleic acid probe. Examples of detection reagents include, but are not limited to 
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radiolabelled probes, enzymatic labeled probes (horseradish peroxidase, alkaline 
phosphatase), and affinity labeled probes (biotin, avidin, or steptavidin). 

In detail, a compartmentalized kit includes any kit in which reagents are contained 
in separate containers. Such containers include small glass containers, plastic containers 

5 or strips of plastic or paper. Such containers allow the efficient transfer of reagents from 
one compartment to another compartment such that the samples and reagents are not 
cross-contaminated and the agents or solutions of each container can be added in a 
quantitative fashion from one compartment to another. Such containers will include a 
container which will accept the test sample, a container which contains the probe or 

10 primers used in the assay, containers which contain wash reagents (such as phosphate 

buffered saline, Tris-buffers, and the hke), and containers which contain the reagents used 
to detect the hybridized probe, bound antibody, amplified product, or the like. One skilled 
in the art will readily recognize that the nucleic acid probes described in the present 
invention can readily be incorporated into one of the established kit formats that are well 

15 known in the art. 

ni. DNA Constructs Comprising a Protein Kinase Nucleic Acid Molecule and Cells 
Containing These Constructs. 

The present invention also relates to a recombinant DNA molecule comprising, 5 ' 
20 to 3', a promoter effective to initiate transcription in a host cell and the above-described 
nucleic acid molecules. In addition, the present invention relates to a recombinant DNA 
molecule comprising a vector and an above-described nucleic acid molecule. The present 
invention also relates to a nucleic acid molecule comprising a transcriptional region 
functional in a cell, a sequence complementary to an RNA sequence encoding an amino 
25 acid sequence corresponding to the above-described polypeptide, and a transcriptional 

termination region functional in said cell. The above-described molecules may be isolated 
and/or purified DNA molecules. 

The present invention also relates to a cell or organism that contains an above- 
described nucleic acid molecule and thereby is capable of expressing a polypeptide. The 
30 polypeptide may be purified from cells that have been altered to express the polypeptide. 
A cell is said to be "altered to express a desired polypeptide" when the cell, through 
genetic manipulation, is made to produce a protein which it normally does not produce or 
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which the cell normally produces at lower levels. One skilled in the art can readily adapt 
procedures for introducing and expressing either genomic, cDNA, or synthetic sequences 
into either eukaryotic or prokaryotic cells. 

A nucleic acid molecule, such as DNA, is said to be "capable of expressing" a 
5 polypeptide if it contains nucleotide sequences which contain transcriptional and 
translational regulatory information and such sequences are "operably linked" to 
nucleotide sequences which encode the polypeptide. An operable linkage is a linkage in 
which the regulatory DNA sequences and the DNA sequence sought to be expressed are 
connected in such a way as to permit gene sequence expression. The precise nature of the 

10 regulatory regions needed for gene sequence expression may vary firom organism to 

organism, but shall in general include a promoter region which, in prokaryotes, contains 
both the promoter (which directs the initiation of RNA transcription) as well as the DNA 
sequences which, when transcribed into RNA, will signal synthesis initiation. Such 
regions will normally include those 5'-non-coding sequences involved with initiation of 

15 transcription and translation, such as the TATA box, capping sequence, CAAT sequence, 
and the like. 

If desired, the non-coding region 3' to the sequence encoding a kinase of the 
invention may be obtained by the above-described methods. This region may be retained 
for its transcriptional termination regulatory sequences, such as termination and 

20 polyadenylation. Thus, by retaining the 3 '-region naturally contiguous to the DNA 

sequence encoding a kinase of the invention, the transcriptional termination signals may 
be provided. Where the transcriptional termination signals are not satisfactorily functional 
in the expression host cell, then a 3* region functional in the host cell may be substituted. 
Two DNA sequences (such as a promoter region sequence and a sequence 

25 encoding a kinase of the invention) are said to be operably linked if the nature of the 

linkage between the two DNA sequences does not (1) result in the introduction of a frame- 
shift mutation, (2) interfere with the ability of the promoter region sequence to direct the 
transcription of a gene sequence encoding a kinase of the invention, or (3) interfere with 
the ability of the gene sequence of a kinase of the invention to be transcribed by the 

30 promoter region sequence. Thus, a promoter region would be operably linked to a DNA 
sequence if the promoter were capable of effecting transcription of that DNA sequence. 
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Thus, to express a gene encoding a kinase of the invention, transcriptional and 
translational signals recognized by an appropriate host are necessary. 

The present invention encompasses the expression of a gene encoding a kinase of 
the invention (or a functional derivative thiereof) in either prokaryotic or eukaryotic cells. 
5 Prokaryotic hosts are, generally, very efficient and convenient for the production of 
recombinant proteins and are, therefore, one type of preferred expression system for 
kinases of the invention. Prokaryotes most frequently are represented by various strains of 
E. colL However, other microbial strains may also be used, including other bacterial 
strains. 

10 In prokaryotic systems, plasmid vectors that contain replication sites and control 

sequences derived from a species compatible with the host may be used. Examples of 
suitable plasmid vectors may include pBR322, pUCl 18, pUCl 19 and the like; suitable 
phage or bacteriophage vectors may include ygtlO, ygtl l and the like; and suitable virus 
vectors may include pMAM-neo, pKRC and the like. Preferably, the selected vector of the 

15 present invention has the capacity to repUcate in the selected host cell. 

Recognized prokaryotic hosts include bacteria such as E. coli. Bacillus, 
Streptomyces, Pseudomonas, Salmonella, Serratia, and the like. However, imder such 
conditions, the polypeptide will not be glycosylated. The prokaryotic host must be 
compatible with the replicon and control sequences in the expression plasmid. 

20 To express a kinase of the invention (or a functional derivative thereof) in a 

prokaryotic cell, it is necessary to operably link the sequence encoding the kinase of the 
invention to a functional prokaryotic promoter. Such promoters may be either constitutive 
or, more preferably, regulatable (i.e., inducible or derepressible). Examples of constitutive 
promoters include the int promoter of bacteriophage X, the bla promoter of the P- 

25 lactamase gene sequence of pBR322, and the cat promoter of the chloramphenicol acetyl 
transferase gene sequence of pPR325, and the hke. Examples of inducible prokaryotic 
promoters include the major right and left promoters of bacteriophage X (Pl and Pr), the 
trp, recA, ^cZ, ZacI, and ^fl/ promoters of E, colU the a-amylase (Ulmanen et al, J. 
Bacteriol. 162:176-182, 1985) and the q-28-specific promoters of B, subtilis (Gihnan et 

30 aL, Gene Sequence 32:1 1-20, 1984), the promoters of the bacteriophages oi Bacillus 

(Gryczan, In: The Molecular Biology of the Bacilli, Academic Press, Inc., NY, 1 982), and 
Streptomyces promoters (Ward et al, Mol. Gen. Genet. 203:468-478, 1986). Prokaryotic 
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promoters are reviewed by Click (Ind. Microbiot. 1:277-282, 1987), Cenatiempo 
(Biochimie 68:505-516, 1986), and Gottesman (Ann. Rev. Genet. 18:415-442, 1984), 

Proper expression in a prokaryotic cell also requires the presence of a ribosome- 
binding site upstream of the gene sequence-encoding sequence. Such ribosome-binding 

5 sites are disclosed, for example, by Gold et ai (Arm. Rev. Microbiol. 35:365-404, 1981 ). 
The selection of control sequences, expression vectors, transformation methods, and the 
like, are dependent on the type of host cell used to express the gene. As used herein, 
"cell", "cell line", and "cell culture" may be used interchangeably and all such 
designations include progeny. Thus, the words "transformants" or "transformed cells" 

1 0 include the primary subject cell and cultures derived therefrom, without regard to the 

number of transfers. It is also understood that all progeny may not be precisely identical 
in DNA content, due to deliberate or inadvertent mutations. However, as defined, mutant 
progeny have the same functionality as that of the originally transformed cell. 

Host cells which may be used in the expression systems of the present invention 

15 are not strictly limited, provided that they are suitable for use in the expression of the 
kinase polypeptide of interest. Suitable hosts may often include eukaryotic cells. 
Preferred eukaryotic hosts include, for example, yeast, fimgi, insect cells, manmialian cells 
either in v/vo,'or in tissue culture. Mammalian cells which may be useful as hosts include 
HeLa cells, cells of fibroblast origin such as VERO or CHO-Kl, or cells of lymphoid 

20 origin and their derivatives. Preferred mammalian host cells include SP2/0 and J558L, as 
well as neuroblastoma cell lines such as IMR 332, which may provide better capacities for 
correct post-translational processing. 

In addition, plant cells are also available as hosts, and control sequences 
compatible with plant cells are available, such as the cauliflower mosaic virus 35S and 

25 19S, and nopaline synthase promoter and polyadenylation signal sequences. Another 
preferred host is an insect cell, for example the Drosophila larvae. Using insect cells as 
hosts, the Drosophila alcohol dehydrogenase promoter can be used (Rubin, Science 
240:1453-1459, 1988). Alternatively, baculovirus vectors can be engineered to express 
large amounts of kinases of the invention in insect cells (Jasny, Science 238:1653, 1987; 

30 Miller et ai, hi: Genetic Engineering, Vol. 8. Plenum, Setlow et al , eds., pp. 111-291, 
1986). 
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Any of a series of yeast expression systems can be utilized which incorporate 
promoter and termination elements from the actively expressed sequences coding for 
glycolytic enzymes that are produced in large quantities when yeast are grown in mediums 
rich in glucose. Known glycolytic gene sequences can also provide very efficient 
transcriptional control signals. Yeast provides substantial advantages in that it can also 
carry out post-translational modifications. A number of recombinant DNA strategies exist 
utihzing strong promoter sequences and high copy number plasmids which can be utilized 
for production of the desired proteins in yeast. Yeast recognizes leader sequences on 
cloned mammalian genes and secretes peptides bearing leader sequences (i.e., pre- 
peptides). Several possible vector systems are available for the expression of kinases of 
the invention in a mammalian host. 

A wide variety of transcriptional and translational regulatory sequences may be 
employed, depending upon the nature of the host. The transcriptional and translational 
regulatory signals may be derived from viral sources, such as adenovirus, bovine 
p£q)illoma virus, cytomegalovirus, simian virus, or the like, where the regulatory signals 
are associated with a particular gene sequence which has a high level of expression. 
Alternatively, promoters from mammahan expression products, such as actin, collagen, 
myosin, and the like, may be employed. Transcriptional initiation regulatory signals may 
be selected which allow for repression or activation, so that expression of the gene 
sequences can be modulated. Of interest are regulatory signals which are temperature- 
sensitive so that by varying the temperature, expression can be repressed or initiated, or 
are subject to chemical (such as metabolite) regulation. 

Expression of kinases of the invention in eukaryotic hosts requires the use of 
eukaryotic regulatory regions. Such regions will, in general, include a promoter region 
sufficient to direct the initiation of RNA synthesis. Preferred eukaryotic promoters 
include, for example, the promoter of the mouse metallothionein I gene sequence (Hamer 
et al, J. Mol. Appl. Gen. 1 :273.288, 1982); the TK promoter of Heq)es virus (McKnight, 
Cell 3 1 :355-365, 1 982); the SV40 early promoter (Benoist et al , Nature (London) 
290:304-31, 1981); and the yeast gal4 gene sequence promoter (Johnston et al., Proc. Natl. 
Acad. Sci. (USA) 79:6971-6975, 1982; Silver et al, Proc. Natl. Acad. Sci. (USA) 
81:5951-5955,1984). 
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Translation of exikaryotic mRNA is initiated at the codon that encodes the first 
methionine. For this reason, it is preferable to ensure that the linkage between a 
eukaryotic promoter and a DNA sequence which encodes a kinase of the invention (or a 
functional derivative thereof) does not contain any intervening codons which are capable 
5 of encoding a methionine (i.e., AUG). The presence of such codons results either in the 

formation of a fusion protein (if the AUG codon is in the same reading frame as the kinase 
of the invention coding sequence) or a firame-shifl mutation (if the AUG codon is not in 
the same reading frame as the kinase of the invention coding sequence). 

A nucleic acid molecule encoding a kinase of the invention and an operably linked 

1 0 promoter may be introduced into a recipient prokaryotic or eukaryotic cell either as a 
nonreplicating DNA or RNA molecule, which may either be a linear molecule or, more 
preferably, a closed covalent circular molecule. Since such molecules are incapable of 
autonomous rephcation, the expression of the gene may occur tiirough the transient 
expression of the introduced sequence. Altematively, permanent expression may occur 

1 5 through the integration of the introduced DNA sequence into the host chromosome. 

A vector may be employed which is capable of integrating the desired gene 
sequences into the host cell chromosome. Cells which have stably integrated the 
introduced DNA into their chromosomes can be selected by also introducing one or more 
markers which allow for selection ofhost cells which contain the expression vector. The 

20 marker may provide for prototrophy to an auxotrophic host, biocide resistance, e.g. , 
antibiotics, or heavy metals, such as copper, or the like. The selectable marker gene 
sequence can eitiier be directly linked to the DNA gene sequences to be expressed, or 
introduced into the same cell by co-transfection. Additional elements may also be needed 
for optimal syntiiesis of mRNA. These elements may include splice signals, as well as 

25 transcription promoters, enhancers, and termination signals. cDNA expression vectors 

incorporating such elements include those described by Okayama (Mol. Cell. Biol. 3 :280-, 
1983). 

* The introduced nucleic acid molecule can be incorporated into a plasmid or viral 
vector capable of autonomous replication in the recipient host. Any of a wide variety of 
30 vectors may be employed for this purpose. Factors of importance in selecting a particular 
plasmid or viral vector include: the ease with which recipient cells that contain the vector 
may be recognized and selected firom those recipient cells which do not contain the vector; 
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the number of copies of the vector which are desired in a particular host; and whether it is 
desirable to be able to "shuttle" the vector between host cells of different species. 

Preferred prokaryotic vectors include plasmids such as those capable of replication 
in £*. coli (such as, for example, pBR322, ColEl, pSClOl, pACYC 184, nVX; "Molecular 
5 Cloning: A Laboratory Manual", 1 989, supra). Bacillus plasmids include pC 1 94, pC22 1 , 
pT127, and the like (Gryczan, In: The Molecular Biology of the Bacilli, Academic Press, 
NY, pp. 307-329, 1982). Suitable Streptomyces plasmids include pi JlOl (Kendall et ai, 
J. Bacteriol. 169:4177-4183, 1987), and streptomyces bacteriophages such as (|>C31 
(Chater et aly In: Sixth International Symposium on Actinomycetales Biology, Akademiai 
10 Kaido, Budapest, Hungary, pp. 45-54, 1986). Pseudomonas plasmids are reviewed by 
John et al (Rev. Infect. Dis. 8:693-704, 1986). and Izaki (Jpn. J, Bacteriol. 33:729-742, 
1978). 

Preferred eukaryotic plasmids include, for example, BPV, vaccinia, SV40, 2- 
micron circle, and the like, or their derivatives. Such plasmids are well known in the art 

15 (Botstein et ai, Miami Wntr. Symp. 19:265-274, 1982; Broach, In: "The Molecular 

Biology of the Yeast Saccharomyces: Life Cycle and Inheritance", Cold Spring Harbor 
Laboratory, Cold Spring Harbor, NY, p. 445-470, 1981; Broach, Cell 28:203-204, 1982; 
Bollon et al, J. Clin. Hematol. Oncol. 10:39-48, 1980; Maniatis, In: Cell Biology: A 
Comprehensive Treatise, Vol. 3, Gene Sequence Expression, Academic Press, NY, pp. 

20 563-608, 1980). 

Once the vector or nucleic acid molecule containing the construct(s) has been 
prepared for expression, the DNA construct(s) may be introduced into an appropriate host 
cell by any of a variety of suitable means, transformation, transfection, conjugation, 
protoplast fusion, electroporation, particle gun technology, calcium phosphate- 

25 precipitation, direct microinjection, and the like. After the introduction of the vector, 
recipient cells are grown in a selective medium, which selects for the growth of vector- 
containing cells. Expression of the cloned gene(s) results in the production of a kinase of 
the invention, or fragments thereof. This can take place in the transformed cells as such, 
or following the induction of these cells to differentiate (for example, by administration of 

30 bromodeoxyuracil to neuroblastoma cells or the like). A variety of incubation conditions 
can be used to form the peptide of the present invention. The most preferred conditions 
are those which mimic physiological conditions. 
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IV. The Proteins of the Invention 

A variety of methodologies known in the art can be utihzed to obtain the 
polypeptides of the present invention. The polypeptides may be purified from tissues or 
cells that naturally produce the polypeptides. Alternatively, the above-described isolated 
5 nucleic acid fragments could be used to express the kinases of the invention in any 
organism. The samples of the present invention include cells, protein extracts or 
membrane extracts of cells, or biological fluids. The samples will vary based on the assay 
format, the detection method, and the nature of the tissues, cells or extracts used as the 
sample. 

10 Any eukaryotic organism can be used as a source for the polypeptides of the 

invention, as long as the soiuxe organism natiu-ally contains such polypeptides. As used • 
herein, "source organism" refers to the original organism from which the amino acid 
sequence of the subunit is derived, regardless of the organism the subunit is expressed in 
and ultimately isolated from. 

15 One skilled in the art can readily follow known methods for isolating proteins in 

order to obtain the polypeptides free of natural contaminants. These include, but are not 
Hmited to: size-exclusion chromatography, HPLC, ion-exchange chromatography, and 
immuno-affinity chromatography. 

Further, the polypeptides of the invention include the full-length polypeptides that 

20 can be identified from the fiill -length or partial sequences encoded by SEQ ID NO: 122, 
SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, 
SEQ ID NO:128, SEQ IDNO:129, SEQ ID NO:130, SEQ ED NO:131, SEQ ID NO:132, 
SEQ ID NO: 133, SEQ ID NO:134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, 
SEQ ID NO: 138, SEQ ED NO: 139, SEQ E) NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, 

25 SEQ ID NO: 143, SEQ ID NO:144, SEQ ID NO: 145, SEQ ID NO: 146, SEQ ID NO: 147, 
SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID N0:151, SEQ ID NO:152, 
SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, 
SEQ ID NO:158, SEQ ED NO:159, SEQ ID NO:160, SEQ ID NO:161, SEQ ID NO:162, 
SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165, SEQ ID NO:166, SEQ ID NO:167, 

30 SEQ IDNO:168, SEQ IDNO:169, SEQ IDNO:170, SEQ ID N0:171, SEQ ID NO:172, 
SEQ ID NO: 173, SEQ ID NO: 174, SEQ ID NO: 175, SEQ ID NO: 176, SEQ ID NO: 177. 
SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, 
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SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, 
SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID NO: 199, 
SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, 
SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, 
5 SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, 
SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:211, SEQ ID NO:212, 
SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, 
SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, 
SEQ ED NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO;226, SEQ ID NO:227, 

10 SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, 
SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, 
SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID 
NO:242 (Figure 1). In addition, the polypeptides of the invention include the domains of 
these polypeptides, including, but not limited to, the N-terminal, kinase/catalytic, and C- 

15 terminal domains. 

The characteristics of the protein kinase nucleic acid sequences of the invention are 
provided in Table 1 . The protein kinases fall into 10 known groups: AGC, CAMK, CKI, 
CMGC, dsPK, EIFK, LIMK, MLK, STE and TK. In addition, there are a significant 
nimiber of protein kinases that do not belong to any of the known groups, and therefore 

20 presumably define new protein kinase groups. 

Additional characteristics are shown in, inter alia, the tables, namely Table 1, 
Table 2, Table 3 and Table 4, provided below. 



V. Antibodies. Hvbridomas. Methods of Use and Kits for Detection of Protein 

25 Kinases 

The present invention relates to an antibody having binding affinity to a kinase of 
the invention. The polypeptide may have an amino acid sequence selected from the group 
consisting of those set forth in SEQ ID NO:122, SEQ ID NO: 123, SEQ ID NO:124, SEQ 
ID NO: 125, SEQ ID NO: 126, SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ 

30 ID NO:130, SEQ ID N0:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ 
ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ 
ID NO:140, SEQ ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ 
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ID NO: 145, SEQ ID NO: 146, SEQ IDNO:147, SEQ ID NO:148, SEQ ID NO: 149, SEQ 
ID NO:150, SEQ ID N0:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ 
ID NO:155, SEQ ID N0:156, SEQ IDN0:157, SEQ ID NO:158, SEQ ID NO:159, SEQ 
ID NO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ 
5 ID NO:165. SEQ ID NO:166, SEQ IDNO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ 
ID NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174. SEQ 
ID NO:175, SEQ JD NO:176, SEQ ID NO:177. SEQ ID NO:178, SEQ ID NO:179, SEQ. 
ID NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ 
ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ 

10 ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ 
ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ JD NO:199, SEQ 
ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ 
ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ 
ID NO:210, SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ 

15 ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ 
ID NO:220, SEQ ID NO:221 , SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ 
ID NO:225, SEQ ID NO:226, SEQ ID NO:227. SEQ ID NO:228, SEQ ID NO:229, SEQ 
ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ 
ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ 

20 ID NO:240, SEQ ID NO:241 , and SEQ ID NO:242, or a functional derivative thereof, or 
at least 9 contiguous amino acids thereof (preferably, at least 20, 30, 35, or 40 or more 
contiguous amino acids thereof). Alternatively, the antibody may bind to a part of the 
polypeptide not provided in the sequences above, but that is present in the full-length 
sequence of the polypeptide and that is easily obtained using methods standard in the art. 
; 25 Further, the antibody may bind specifically to particular domains of one or more of the 
kinases of the invention, including, but not limited to, the N-tenninal, kinase/catalytic, or 
C-tenninal domains. 

The present invention also relates to an antibody having specific binding affinity to 
a kinase or kinase domain of the invention. Such an antibody may be isolated by 
30 comparing its binding affinity to a kinase of the invention with its binding affinity to other 
polypeptides. Those that bind selectively to a kinase of the invention would be chosen for 
use in methods requiring a distinction between a kinase of the invention and other 
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polypeptides. Such methods could include, but should not be limited to, the analysis of 
altered kinase expression in tissue containing other polypeptides. 

The kinases of the present invention can be used in a variety of procedures and 
methods, such as for the generation of antibodies, for use in identifying pharmaceutical 
5 compositions, and for studying DNA/protein interaction. 

The kinases of the present invention can be used to produce antibodies or 
hybiidomas. One skilled in the art will recognize that if an antibody is desired, such a 
peptide could be generated as described herein and used as an immunogen. The antibodies 
of the present invention include monoclonal and polyclonal antibodies, as well fragments 

10 of these antibodies, and humanized forms. Humanized forms of the antibodies of the 

present invention may be generated using one of the procedures known in the art such as 
chimerization or CDR grafting. 

The present invention also relates to a hybridoma that produces the above- 
described monoclonal antibody, or binding fragment thereof. A hybridoma is an 

15 immortalized cell line that is capable of secreting a specific monoclonal antibody. 

In general, techniques for preparing monoclonal antibodies and hybridomas are 
well known in the art (Campbell, **Monoclonal Antibody Technology: Laboratory 
Techniques in Biochemistry and Molecular Biology," Elsevier Science Publishers, 
Amsterdam, The Netherlands, 1984; St. Grothera/., J. Immunol. Methods 35:1-21. 1980). 

20 Any animal (mouse, rabbit, and the Uke) which is known to produce antibodies can be 
immunized with the selected polypeptide. Methods for immunization are well known in 
the art. Such methods include subcutaneous or intraperitoneal injection of the 
polypeptide. One skilled in the art will recognize that the amount of polypeptide used for 
immunization will vary based on the animal that is inmnmized, the antigenicity of the 

25 polypeptide and the site of injection. 

The polypeptide may be modified or administered in an adjuvant in order to 
increase the peptide antigenicity. Methods of increasing the antigenicity of a polypeptide 
are well known in the art. Such procedures include coupling the antigen with a 
heterologous protein (such as globulin or (J-galactosidase) or through the inclusion of an 

30 adjuvant during immunizatioiL 
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For monoclonal antibodies, spleen cells fix)m the immunized animals are removed, 
fused with myeloma cells, such as SP2/0-Agl4 myeloma cells, and allowed to become 
monoclonal antibody producing hybridoma cells. Any one of a number of methods well 
known in the art can be used to identify the hybridoma cell that produces an antibody with 
5 the desired characteristics. These include screening the hybridomas with an ELISA assay, 
westemblot analysis, or radioimmunoassay (Lutzer^i/., Exp. Cell Res. 175:109-124, 
1988). Hybridomas secreting the desired antibodies are cloned and the class and subclass 
are determined using procedures known in the art (Campbell, "Monoclonal Antibody 
Technology: Laboratory Techniques in Biochemistry and Molecular Biology", supra, 
10 1984). 

For polyclonal antibodies, antibody-containing antisera is isolated from the 
immunized animal and is screened for the presence of antibodies with the desired 
specificity using one of the above-described procedures. The above-described antibodies 
may be detectably labeled. Antibodies can be detectably labeled through the use of 

15 radioisotopes, affinity labels (such as biotin, avidin, and the like), enzymatic labels (such 

as horse radish peroxidase, alkaline phosphatase, and the like) fluorescent labels (such as 
FITC or rhodamine, and the like), paramagnetic atoms, and the like. Procedures for 
accomplishing such labeling are well-known in the art, for example, see Stemberger et aL, 
J. Histochem. Cytochem. 18:315, 1970; Bayer a/., Meth. Enzym. 62:308-, 1979; Engval 

20 et al, Immunol. 109:129-, 1972; Coding, J. Immunol. Meth. 13:215-, 1976. The labeled 
antibodies of the present invention can be used for in vitro, in vzvo, and in situ assays to 
identify cells or tissues that express a specific peptide. 

The above-described antibodies may also be immobilized on a solid support. 
Examples of such solid supports include plastics such as polycarbonate, complex 

25 carbohydrates such as agarose and sepharose, acrylic resins and such as polyacrylamide 

and latex beads. Techniques for coupling antibodies to such solid ^pports are well known 
in the art (Weir et al, "Handbook of Experimental Immunology" 4th Ed., Blackwell 
Scientific Pubhcations, Oxford, England, Chapter 10, 1986; Jacoby et al, Meth. Enzym. 
34, Academic Press, N.Y., 1974). The immobilized antibodies of the present invention 

30 can be used for in vitro, in vivo, and in situ assays as well as in inmixmochromotography. 
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Furthermore, one skilled in the art can readily ad^t currently available procedures, 
as well as the techniques, methods and kits disclosed herein with regard to antibodies, to 
generate peptides capable of binding to a specific peptide sequence in order to generate 
rationally designed antipeptide peptides (Hurby et aL, "Application of Synthetic Peptides: 
5 Antisense Peptides", In Synthetic Peptides, A User's Guide, W.H. Freeman, NY, pp. 289- 
307, 1992; Kaspczaker^/., Biochemistry 28:9230-9238, 1989), 

Anti-peptide peptides can be generated by replacing the basic amino acid residues 
found in the peptide sequences of the kinases of the invention with acidic residues, while 
maintaining hydrophobic and uncharged polar groups. For example, lysine, arginine, 

10 and/or histidine residues are replaced with aspartic acid or giutamic acid and glutamic acid 
residues are replaced by lysine, arginine or histidine. 

The present invention also encompasses a method of detecting a kinase 
polypeptide in a sample, comprising: (a) contacting the sample with an above-described 
antibody, under conditions such that immunocomplexes form, and (b) detecting the 

15 presence of said antibody bound to the polypeptide. In detail, the methods comprise 

incubating a test sample with one or more of the antibodies of the present invention and 
assaying whether the antibody binds to the test sample. Altered levels of a kinase of the 
invention in a sample as compared to normal levels may indicate disease. 

Conditions for incubating an antibody with a test sample vary. Incubation 

20 conditions depend on the format employed in the assay, the detection methods employed, 
and the type and nature of the antibody used in the assay. One skilled in the art will 
recognize that any one of the commonly available immunological assay formats (such as 
radioimmunoassays, enzyme-linked immunosorbent assays, diffusion based Ouchterlony, 
or rocket immunofluorescent assays) can readily be adapted to employ the antibodies of 

25 the present invention. Examples of such assays can be found in Chard ("An Introduction 
to Radioimmunoassay and Related Techniques" Elsevier Science Publishers, Amsterdam, 
The Netherlands, 1986), Bullock et al ("Techniques in Immunocytochemistry," Academic 
Press, Orlando, FL Vol. 1, 1982; Vol. 2, 1983; Vol, 3, 1985), Tijssen ("Practice and 
Theory of Enzyme Immunoassays: Laboratory Techniques in Biochemistry and Molecular 

30 Biology," Elsevier Science Publishers, Amsterdam, The Netherlands, 1 985). 
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The immunological assay test samples of the present invention include cells, 
protein or membrane extracts of cells, or biological fluids such as blood, serum, plasma, or 
urine. The test samples used in the above-described method will vary based on the assay 
format, nature of the detection method and the tissues, cells or extracts xised as the sample 
5 to be assayed. Methods for preparing protein extracts or membrane extracts of cells are 
well known in the art and can be readily be adapted in order to obtain a sample which is 
testable with the system utilized. 

A kit contains all the necessary reagents to carry out the previously described 
methods of detection. The kit may comprise: (i) a first container means containing an 
1 0 above-described antibody, and (ii) second container means containing a conjugate 
comprising a binding partner of the antibody and a label. In another preferred 
embodiment, the kit further comprises one or more other containers comprising one or 
more of the following: wash reagents and reagents capable of detecting the presence of 
bound antibodies. 

15 Examples of detection reagents include, but are not limited to, labeled secondary 

antibodies, or in the alternative, if the primary antibody is labeled, the chromophoric, 
enzymatic, or antibody binding reagents that are capable of reacting with the labeled 
antibody. The compartmentalized kit may be as described above for nucleic acid probe 
kits. One skilled in the art will readily recognize that the antibodies described in the 

20 present invention can readily be incorporated into one of the established kit formats that 
are well known in the art. 

VI. Isolation of Compounds That Interact With Protein Kinases 

The present invention also relates to a method of detecting a compound capable of 
binding to a protein kinase of the invention, comprising incubating the compound with a 
25 kinase of the invention and detecting the presence of the compoimd bound to the kinase. 

The compoimd may be present within a complex mixture, for example, serum, body fluid, 
or cell extracts. 

The present invention also relates to a method of detecting an agonist or antagonist 
of kinase activity or kinase binding partner activity comprising incubating cells that 
30 produce a kinase of the invention in the presence of a compound and detecting changes in 
the level of kinase activity or kinase binding partner activity. The compounds thus 
identified would produce a change in activity indicative of the presence of the compound. 
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The compound may be present within a complex mixture, for example, serum, body fluid, 
or cell extracts. Once the compound is identified it can be isolated using techniques well 
known in the art. 

The present invention also encompasses a method of agonizing (stimulating) or 
5 antagonizing kinase associated activity in a mammal comprising administering to said 
mammal an agonist or antagonist to a kinase of the invention in an amoimt sufficient to 
effect said agonism or antagonism. A method of treating diseases in a mammal with an 
agonist or antagonist of kinase activity comprising administering the agonist or antagonist 
to a mammal in an amount sufficient to agonize or antagonize kinase associated functions 

1 0 is also encompassed in the present application. 

In an effort to discover novel treatments for diseases, biomedical researchers and 
chemists have designed, synthesized, and tested molecules that inhibit the function of 
protein kinases. Some small organic molecules form a class of compounds that modulate 
the function of protein kinases. Examples of molecules that have been reported to inhibit 

15 the function of protein kinases include, but are not limited to, bis monocyclic, bicyclic or 
heterocychc aryl compounds (PCT WO 92/20642, published November 26, 1992 by 
Maguire et al\ vinylene-azamdole derivatives (PCT WO 94/14808, published July 7, 
1994 by Ballinari et al\ l-cyclopropyl-4-pyridyl-quinolones (U.S. Patent No. 5,330,992), 
styryl compoimds (U.S. Patent No. 5,217,999), styryl-substituted pyridyl compounds (U.S. 

20 Patent No. 5,302,606), certain quinazoline derivatives (EP Application No. 0 566 266 Al), 
seleoindoles and selenides (PCT WO 94/03427, published February 17, 1994 by Denny et 
al\ tricyclic pol}*ydroxylic compounds (PCT WO 92/21660, published December 10, 
1992 by Dow), and benzylphosphonic acid compounds (PCT WO 91/15495, published 
October 17, 1 991 by Dow et al). 

25 Compounds that can traverse cell membranes and are resistant to acid hydrolysis 

are potentially advantageous as therapeutics as they can become highly bioavailable after 
being administered orally to patients. However, many of these protein kinase inhibitors 
only weakly inhibit the function of protein kinases. In addition, many inhibit a variety of 
protein kinases and will cause multiple side-effects as therapeutics for diseases. 

30 Some indolinone compoimds, however, form classes of acid resistant and 

membrane permeable organic molecules. WO 96/22976 (pubhshed August 1, 1996 by 
Ballinari et al) describes hydrosoluble indolinone compounds that harbor tetralin. 
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naphthalene, quinoline, and indole substituents fused to the oxindole ring. These bicyclic 
substituents are in txim substituted with polar moieties including hydroxylated alkyl, 
phosphate, and ether moieties. U.S. Patent AppUcation Serial Nos. 08/702,232, filed 
August 23, 1996, entitled "Indolinone Combinatorial Libraries and Related Products and 
5 Methods for the Treatment of Disease" by Tang et al (Lyon & Lyon Docket No. 22 1/1 87) 
and 08/485,323, filed June 7, 1995, entitled **Benzylidene-Z-Indoline Compounds for the 
Treatment of Disease" by Tang et al (Lyon & Lyon Docket No. 223/298) and 
International Patent Publication WO 96/22976, published August 1, 1996 by Ballinari et 
al, all of which are incorporated herein by reference in their entirety, including any 

10 drawings, describe indolinone chemical libraries of indolinone compounds harboring other 
bicyclic moieties as well as monocyclic moieties fiised to the oxindole ring. Applications 
08/702,232, filed August 23, 1996, entitled "Indolinone Combinatorial Libraries and 
Related Products and Methods for the Treatment of Disease" by Tang et al. (Lyon & Lyon 
Docket No. 221/187), 08/485,323, filed June 7, 1995, entitled "Benzylidene-Z-Indoline 

1 5 Compounds for the Treatment of Disease" by Tang et al (Lyon & Lyon Docket No. 

223/298), and WO 96/22976, published August 1, 1996 by Ballinari et al teach methods 
of indolinone synthesis, methods of testing the biological activity of indolinone 
compounds in cells, and inhibition patterns of indolinone derivatives. 

Other examples of substances capable of modulating kinase activity include, but 

20 are not limited to, tyrphostins, quinazolines, quinoxohnes, and quinolines. The 

quinazolines, tyrphostins, quinolines, and quinoxolines referred to above include well 
known compounds such as those described in the literature. For example, representative 
publications describing quinazolines include Barker et al, EPO Publication No. 0 520 722 
Al; Jones et al, U.S. Patent No.4,447,608; Kabbe et al, U.S. Patent No. 4,757,072; Kaul 

25 and Vougioukas, U.S. Patent No. 5, 3 16,553; Kreighbaum and Comer, U.S. Patent No. 
4,343,940; Pegg and Wardleworth, EPO Publication No. 0 562 734 Al ; Barker et al, 
Proc. of Am. Assoc. for Cancer Research 32:327 (1991); Bertino, J.R., Cancer Research 
3:293-304 (1979); Bertino, J.R., Cancer Research 9(2 part l):293-304 (1979); Curtin et 
al, Br. J. Cancer 53:361-368 (1986); Femandes et al. Cancer Research 43: 11 17-1123 

30 (1983); Ferris etal J. Org. Chem. 44f2):173-178; Fry a/., Science 265:1093-1095 

(1994); Jackman et al. Cancer Research 51:5579-5586 (1981); Jones et al J. Med. Chem. 
29(6):1 114-1 118; Lee and Skibo, Biochemistry 26(23):7355-7362 (1987); Lemus et al, L 
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Org. Chem. 54:3511-3518 (1989); Ley and Seng, Synthesis 1975:415-522 (1975); 
Maxwell et al. Magnetic Resonance in Medicine 17:189-196 (1991); Mini et ai. Cancer 
Research 45:325-330 (1985); Phillips and Castle, J. Heterocyclic Chem. 17(19):1489-1596 
(1980); Reece et al. Cancer Research 47(ll):2996-2999 (1977); Sculier et al. Cancer 
5 hnmunol. and Immunother. 23:A65 (1986); Sikora et al. Cancer Letters 23:289-295 
(1984); Sikora et al. Analytical Biochem. 172:344-355 (1988); all of which are 
incorporated herein by reference in their entirety, including any drawings. 

Quinoxaline is described in Kaul and Vougioukas, U.S. Patent No. 5,316,553, 
incorporated herein by reference in its entirety, including any drawings. 

1 0 Quinolines are described in DoUe et al , J. Med. Chem. 37 :2627-2629 ( 1 994); 

MaGuire, J. Med. Chem. 37:2129-2131 (1994); Burke et al, J. Med. Chem. 36:425-432 
(1993); and Burke et al BioOrganic Med. Chem. Letters 2:1771-1774 (1992), all of which 
are incorporated by reference in their entirety, including any drawings. 

Tyrphostins are described in Allen et al, Clin. Exp. Immunol. 91:141-156 (1993); 

15 Anafi etal. Blood 82:12:3524-3529 (1993); Baker efc/., J. Cell Sci. 102:543-555 (1992); 
Bilder et al, Amer. Physiol. Soc. pp. 6363-6143:C721-C730 (1991); Brunton et al. 
Proceedings of Amer. Assoc. Cancer Rsch. 33:558 (1992); Bryckaert et al , Experimental 
Cell Research 199:255-261 (1992); Dong et al, J. Leukocyte Biology 53:53-60 (1993); 
Dong et al, J. Immunol. 151(5):2717-2724 (1993); Gazit et al, J. Med. Chem. 32:2344- 

20 2352 (1989); Gazit et al, " J. Med. Chem. 36:3556-3564 (1993); Kaur et al, Anti-Cancer 
Drugs 5:213-222 (1994); Kaur et al. King et al, Biochem. J. 275:413-418 (1991); Kuo et 
al. Cancer Letters 74:197-202 (1993); Levitzki, A., The FASEB J. 6:3275-3282 (1992); 
Lyall et a/.. J. Biol. Chem. 264:14503-14509 (1989); Peterson et al. The Prostate 22:335- 
345 (1993); Pillemer et al. Int. J. Cancer 50:80-85 (1992); Posner et al. Molecular 

25 Phamiacology 45:673-683 (1993); Rendu et al, BioL Pharmacology 44(5):881-888 
(1992); Sauro and Thomas, Life Sciences 53:371-376 (1993); Sauro and Thomas, L 
Phaim. and Experimental Therapeutics 267(3): 1 19-1 125 (1993); Wolbring et al, J. Biol. 
Chem. 269(36):22470-22472 (1994); and Yoneda et al. Cancer Research 51 :4430-4435 
(1991); all of which are incorporated herein by reference in their entirety, including any 

30 drawings. 
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Other compounds that could be used as modulators include oxindolinones such as 
those described in U.S. patent application Serial No. 08/702,232 filed August 23, 1996, 
incorporated herein by reference in its entirety, including any drawings. 
Vn. Biological Significance, Applications and Clinical Relevance of Novel Protein 
5 Kinases 

For each protein kinase in this application, we provide a classification of the 
protein class and family to which it belongs, a summary of non-cataltyic protein motifs, a 
profile of its expression in several himdred tissue and cell sources, and a chromosomal 
location. This information can be used to suggest potential function, regulation or 
10 ther^eutic utility for each of the proteins. 

The kinase classification and protein domains often reflect pathways, cellular roles, 
or mechanisms of up- or down-stream regulation. Also disease-relevant genes often occur 
in families of related genes. For example if one member of a kinase family functions as an 
oncogene, a tumor suppressor, or has been found to be disrupted in an immune, 
15 neurologic, cardiovascular, or metabolic disorder, fi-equently other family members may 
play a related role. 

The expression analysis organizes kinases into groups that are transcriptionally 
upregulated in tumors and those that are more restricted to specific tumor types such as 
melanoma or prostate. This analysis also identifies genes that are regulated in a cell cycle 

20 dependent manner, and are therefore likely to be involved in maintaining cell cycle 

checkpoints, entry, progression, or exit fi^m mitosis, oversee DNA repair, or are involved 
in cell proUferation and genome stability. Expression data also can identify genes 
expressed in endothelial sources or other tissues that suggest a role in angiogenesis, 
thereby imphcating them as targets for control of diseases that have an angiogenic 

25 component, such as cancer, endometriosis, retinopathy and macular degeneration, and 
various ischemic or vascular pathologies. A proteins' role in cell survival can also be 
suggested based on restricted expression in cells subjected to external stress such as 
oxidative damage, hypoxia, drugs such as cisplatinum, or irradiation. Metastases- 
associated genes can be implicated when expression is restricted to invading regions of a 

30 tumor, or is only seen in local or distant metastases compared to the primary tumor, or 

when a gene is upregulated during cell culture models of invasion, migration, or motility. 
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Qiromosomal location can identify candidate targets for a tumor amplicon or a 
tumor-suppressor locxis. Summaries of prevelant tumor amplicons are available in the 
literature, and can identify tumor types to experimentally be confirmed to contain 
amplified copies of a kinase gene which localizes to an adjacent region. 
5 Based on these criteria several kinases immediately stand out as being of potential 

therapeutic relevance. The protein kinases can be divided into the following disease- 
relevant categories (nucleotide Seq ED #s in parentheses): i 

Tumor associated: Mok (SEQ ID NO:NO:57), EPK2, AA3 16804 (SEQ ID 
N0:1 1). AA435956 (SEQ ID NO:NO:48), AA278842 (SEQ ID NO:88), AA599286 (SEQ 
10 'ID NO:89), AA826850 (SEQ ID N0:3), HRI (SEQ ID NO:73), MLK4 AA232253 (SEQ 
IDNO:82), AA883975 SGK235 (SEQIDNO:95), AA311714(SEQIDNO:101X 
MPSKl (SEQ ID N0:1 10), R19609 (Seq IDl 1 1), AA383293 (SEQ ID NO:26). 

Prostate-specific: AA234451 (SEQ ID NO:47), TSK4 (SEQ ID NO:93), RIP4 
(SEQ ID NO:84), KIAA0965 (SEQ ID N0:8). 
15 (Oncogenic or proliferation associated: KIAA0781 (SEQ ID NO:38), AA789239 

(SEQ ID NO:52), CCRK (SEQ ID NO:54), CLK4 (SEQ ID NO:55), H85389 (SEQ ID 
NO:97). 

Neuronal restricted: CAMKKB (SEQ ID NO:66) 

Hematopoietic expressed: PTK9L (SEQ ID NO:22), DRAK2 (SEQ ID NO:29), 
20 AI025291 (SEQ ID NO:94) 

Angiogenic or endothelial expressed: DRAKl (SEQ ID NO:31), MAK-V (SEQ 
ID NO:40), TRAD (SEQ ID NO:44), MOK (SEQ ID NO:57), AA08847 (SEQ ID NO:78), 
HGP_:66444466 (SEQ ID NO:79), RSK4 (SEQ ID NO:16). 

Cell cycle regulated: AA454060 (SEQ ID NO:45), KIAA0999 (Mitotic - SEQ ID 
25 NO:32), AA579641 (Mitotic - SEQ ID NO:60), AA305176 (Mitotic - SEQ ID NO:6), 
AA018361 (SI phase- SEQ ID NO:100). 
Vin. Transgenic Animals, 

A variety of methods are available for the production of transgenic animals 
associated with this invention. DNA can be injected into the pronucleus of a fertilized egg 
30 before fusion of the male and female pronuclei, or injected into the nucleus of an 

embryonic cell (e.g., the nucleus of a two-cell embryo) following the initiation of cell 
division (Brinster etaL, Proc. Nat. Acad. Sci. USA 82: 4438-4442, 1985). Embryos can 
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be infected with viruses, especially retroviruses^ modified to carry inorganic-ion receptor 
nucleotide sequences of the invention. 

Pluripotent stem cells derived fi-om the inner cell mass of the embryo and 
stabilized in culture can be manipulated in culture to incorporate nucleotide sequences of 
5 the invention. A transgenic animal can be produced from such cells through implantation 
into a blastocyst that is implanted into a foster mother and allowed to come to term. 
Animals suitable for transgenic experiments can be obtained from standard commercial 
sources such as Charles River (Wilmington, MA), laconic (Germantown, NY), Harlan 
Sprague Dawley (Indianapolis, IN), etc. 

10 The procedures for manipulation of the rodent embryo and for microinjection of 

DNA into the pronucleus of the zygote are well known to those of ordinary skill in the art 
(Hogan et al.^ supra). Microinjection procedures for fish, amphibian eggs and birds are 
detailed in Houdebine and Chourrout (Experientia 47: 897-905, 1991). Other procedures 
for introduction of DNA into tissues of animals are described in U.S. Patent No., 

15 4,945,050 (Sanford et al, July 30, 1990). 

By way of example only, to prepare a transgenic mouse, female mice are induced 
to superovulate. Females are placed with males, and the mated females are sacrificed by - 
CO2 asphyxiation or cervical dislocation and embryos are recovered from excised 
oviducts. Surrounding cumulus cells are removed. Pronuclear embryos are then washed 

20 and stored until the time of injection. Randomly cycling adult female mice are paired with 
vasectomized males. Recipient females are mated at the same time as donor females. 
Embryos then are transferred surgically. The procedure for generating transgenic rats is 
similar to that of mice (Hammer et al. Cell 63:1099-1 1 12, 1990). 

Methods for the culturing of embryonic stem (ES) cells and the subsequent 

25 production of transgenic animals by the introduction of DNA into ES cells using methods 
such as electroporation, calcium phosphate/DNA precipitation and direct injection also are 
well known to those of ordinary skill in the art (Teratocarcinomas and Embryonic Stem 
Cells, A Practical Approach, E.J. Robertson, ed., IRL Press, 1987). 

In cases involving random gene integration, a clone containing the sequence(s) of 

30 the invention is co-transfected with a gene encoding resistance. Alternatively, the gene 
encoding neomycin resistance is physically linked to the sequence(s) of the invention. 
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Transfection and isolation of desired clones are cairied out by any one of several methods 
well known to those of ordinary skill in the art (E.J. Robertson, supra). 

DNA molecules introduced into ES cells can also be integrated into the 
chromosome through the process of homologous recombination (Capecchi, Science 244: 
1288-1292, 1989). Methods for positive selection of the recombination event (/.e., neo 
resistance) and dual positive-negative selection (/.e., neo resistance and gancyclovir 
resistance) and the subsequent identification of the desired clones by PCR have been 
described by Capecchi, supra and Joyner et al (Nature 338: 153-156, 1989), the teachings 
of which are incorporated herein in their entirety including any drawings. The final phase 
of the procedure is to inject targeted ES cells into blastocysts and to transfer the 
blastocysts into pseudopregnant females. The resulting chimeric animals are bred and the 
of&pring are analyzed by Southern blotting to identify individuals that carry the transgene. 
Procedures for the production of non-rodent mammals and other animals have been 
discussed by others (Houdebine and Chourrout, supra\ Pursel et al. Science 244:1281- 
1288, 1989; and Simms etal, Bio/Technology 6:179-183, 1988). 

Thus, the invention provides transgenic, nonhuman mammals containing a 
transgene encoding a kinase of the invention or a gene effecting the expression of the 
kinase. Such transgenic nonhuman mammals are particularly usefiil as an in vivo test 
system for studying the effects of introduction of a kinase, or regulating the expression of 
a kinase {i.e., through the introduction of additional genes, antisense nucleic acids, or 
ribozymes). 

A **transgemc animar' is an animal having cells that contain DNA which has been 
artificially inserted into a cell, which DNA becomes pari of the genome of the animal 
which develops fix>m that cell. Preferred transgenic animals are primates, mice, rats, 
cows, pigs, horses, goats, sheep, dogs and cats. The transgenic DNA may encode human 
STE20-related kinases. Native expression in an animal may be reduced by providing an 
amount of anti-sense RNA or DNA effective to reduce expression of the receptor. 

DC. Gene Therapv 

Protein kinases of the invention, or their genetic sequences will also be useful in 
gene therapy (reviewed in Miller, Nature 357:455-460, 1992). Miller states that advances 
have resulted in practical approaches to human gene therapy that have demonstrated 
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positive initial results. The basic science of gene therapy is described in Mulligan 
(Science 260:926-93 1, 1993). 

In one preferred embodiment, an expression vector containing protein kinase 
coding sequence is inserted into cells, the cells are grown in vitro, and then are infused in 
5 large numbers into patients. In another preferred embodiment, a DNA segment containing 
a promoter of choice (for example a strong promoter) is transferred into cells containing 
an endogenous gene encoding kinases of the invention in such a manner that the promoter 
segment enhances expression of the endogenous kinase gene (for example, the promoter 
segment is transferred to the cell such that it becomes directly linked to the endogenous 
10 kinase gene). 

The gene ther£^y may involve the use of an adenovirus containing kinase cDNA 
targeted to a tumor, systemic kinase increase by implantation of engineered cells, injection 
with kinase-encoding virus, or injection of naked kinase DNA into appropriate tissues. 

Target cell populations may be modified by introducing altered forms of one or 
15 more components of the protein complexes in order to modulate the activity of such 

complexes. For example^ by reducing or inhibiting a complex component activity within 
target cells, an abnormal signal transduction event(s) leading to a condition may be 
decreased, inhibited, or reversed. Deletion or missense mutants of a component, that 
retain the ability to interact with other components of the protein complexes but cannot 
20 function in signal transduction may be used to inhibit an abnormal, deleterious signal 
transduction event. 

Expression vectors derived from viruses such as retroviruses, vaccinia virus, 
adenovirus, adeno-associated virus, heipes viruses, several RNA viruses, or bovine 
papilloma virus, may be used for delivery of nucleotide sequences (e.g,, cDNA) encoding 

25 recombinant kinase of the invention protein into the targeted cell population {e.g., tumor 
cells). Methods which are well known to those skilleid in the ait can be used to construct 
recombinant viral vectors containing coding sequences (Maniatis et al, Molecular 
Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, N.Y., 1989; Ausubel et 
aiy Current Protocols in Molecular Biology, Greene Publishing Associates and Wiley 

30 Interscience, N.Y., 1989). Alternatively, recombinant nucleic acid molecules encoding 
protein sequences can be used as naked DNA or in a reconstituted system e,g., liposomes 
or other lipid systems for delivery to target cells {e.g,. Feigner et al. Nature 337:387-8, 
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1989). Several other methods for the direct transfer of plasmid DNA into cells exist for 
use in human gene therapy and involve targeting the DNA to receptors on cells by 
complexing the plasmid DNA to proteins (Miller, supra). 

In its simplest form, gene transfer can be performed by simply injecting minute 
5 amounts of DNA into the nucleus of a cell, through a process of microinjection (Capecchi, 
Cell 22:479-88, 1980). Once recombinant genes are introduced into a cell, they can be 
recognized by the cell's normal mechanisms for transcription and translation, and a gene 
product will be expressed. Other methods have also been attempted for introducing DNA 
into larger numbers of cells. These methods include: transfection, wherein DNA is 

10 precipitated with CaP04 and taken into cells by pinocytosis (Chen et al^ Mol. Cell Biol. 
7:2745-52, 1987); electroporatibn, wherein cells are exposed to large voltage pulses to 
introduce holes into the membrane (Chu et al. Nucleic Acids Res. 15:131 1-26, 1987); 
lipofection/liposome fusion, wherein DNA is packaged into lipophilic vesicles which fuse 
with a target cell (Feigner et aL, Proc. Natl. Acad. Sci. USA. 84:7413-7417, 1987); and 

1 5 particle bombardment using DNA bound to small projectiles (Y ang et al , Proc. Natl. 

Acad. Sci. 87:9568-9572, 1990). Another method for introducing DNA into cells is to 
couple the DNA to cheniically modified proteins. 

It has also been shown that adenovirus proteins are capable of destabilizing 
endosomes and enhancing the uptake of DNA into cells. The admixture of adenovirus to 

20 solutions containing DNA complexes, or the binding of DNA to polylysine covalently 

attached to adenovirus using protein crosslinking agents substantially improves the uptake 
and expression of the recombinant gene (Curiel et aL, Am. J. Respir, Cell. Mol. Biol, 
6:247-52, 1992). 

As used herein "gene transfer" means the process of introducing a foreign nucleic 
25 acid molecule into a cell. Gene transfer is commonly performed to enable the expression 
of a particular product encoded by the gene. The product may include a protein, 
polypeptide, anti-sense DNA or RNA, or enzymatically active RNA. Gene transfer can be 
performed in cultured cells or by direct administration into animals. Generally gene 
transfer involves the process of nucleic acid contact with a target cell by non-specific or 
30 receptor mediated interactions, uptake of nucleic acid into the cell through the membrane 
or by endocytosis, and release of nucleic acid into the cytoplasm from the plasma 
membrane or endosome. Expression may require, in addition, movement of the nucleic 
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acid into the nucleus of the cell and binding to appropriate nuclear factors for 
transcription. 

As used herein "gene therapy" is a fonn of gene transfer and is included within the 
definition of gene transfer as used herein and specifically refers to gene transfer to express 
5 a therapeutic product from a cell in vivo or in vitro. Gene transfer can be performed ex 
vivo on cells which are then transplanted into a patient, or can be performed by direct 
administration of the nucleic acid or nucleic acid-protein complex into the patient. 

In another preferred embodiment, a vector having nucleic acid sequences encoding 
a protein kinase polypeptide of the invention is provided in which the nucleic acid 
10 sequence is expressed only in specific tissue. Methods of achieving tissue-specific gene 
expression are set forth in International Publication No. WO 93/09236, filed November 3, 
1 992 and published May 13, 1993. 

In all of the preceding vectors set forth above, a further aspect of the invention is 
that the nucleic acid sequence contained in the vector may mclude additions, deletions or 
15 modifications to some or all of the sequence of the nucleic acid, as defined above. 

In another preferred embodiment, a method of gene replacement is set forth. 
"Gene replacement" as used herein means supplying a nucleic acid sequence which is 
capable of being expressed in vivo in an animal and thereby providing or augmenting the 
function of an endogenous gene that is missing or defective in the animal. 
20 X. Administration of Substances 

Methods of determining the dosages of compoxmds to be administered to a patient 
and modes of administering compounds to an organism are disclosed in U.S. Application 
Serial No. 08/702,282, filed August 23, 1996 and Intemational patent pubUcation number 
WO 96/22976, published August 1 1996, both of which are incorporated herein by 
25 reference in their entirety, including any drawings, figures, or tables. Those skilled in the 
art will appreciate that such descriptions are applicable to the present invention and can be 
easily adapted to it. 

The proper dosage depends on various factors such as the type of disease being 
treated, the particular composition being used, and the size and physiological condition of 
30 the patient. Therapeutically effective doses for the compounds described herein can be 
estimated initially fi*om cell culture and animal models. For example, a dose can be 
formulated in animal models to achieve a circulating concentration range that initially 
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takes into account the IC50 as determined in cell culture assays. The animal model data 
can be used to more accurately determine useful doses in humans. 

Plasma half-life and biodistribution of the drug and metabolites in the plasma, 
tumors, and major organs can be also be deteraiined to facilitate the selection of drugs 
5 most appropriate to inhibit a disorder. Such measurements can be carried out. For 

example, HPLC analysis can be performed on the plasma of animals treated with the drug 
and the location of radiolabeled compounds can be determined using detection methods 
such as X-ray, CAT scan, and MRI. Compounds that show potent inhibitory activity in 
the screening assays, but have poor pharmacokinetic characteristics, can be optimized by 

10 altering the chemical structure and retesting. In this regard, compounds displaying good 
pharmacokinetic characteristics can be used as a model. 

Toxicity studies can also be carried out by measuring the blood cell composition. 
For example, toxicity studies can be carried out in a suitable animal model as follows: 1) 
the compound is administered to mice (an untreated control mouse should also be used); 

15 2) blood samples are periodically obtained via the tail vein from one mouse in each 
treatment group; and 3) the samples are analyzed for red and white blood cell counts, 
blood cell composition, and the percent of lymphocytes versus polymorphonuclear cells. 
A comparison of results for each dosing regime with the controls indicates if toxicity is 
present. 

20 At the termination of each toxicity study, further studies can be carried out by 

sacrificing the animals (preferably, in accordance with the American Veterinary Medical 
Association guidelines Report of the American Veterinary Medical Assoc. Panel on 
Euthanasia, Journal of American Veterinary Medical Assoc, 202:229-249, 1993). 
Representative animals from each treatment group can then be examined by gross 

25 necropsy for immediate evidence of metastasis, unusual illness, or toxicity. Gross 

abnormalities in tissue are noted, and tissues are examined histologically. Compoimds 
causing a reduction in body weight or blood components are less preferred, as are 
compoimds having an adverse effect on major organs. In general, the greater the adverse 
effect the. less preferred the compound. 
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For the treatment of cancers the expected daily dose of a hydrophobic 
pharmaceutical agent is between 1 to 500 mg/day, preferably 1 to 250 mg/day, and most 
preferably 1 to 50 mg/day. Drugs can be delivered less frequently provided plasma levels 
of the active moiety are sufficient to maintain therapeutic effectiveness. 

Plasma levels should reflect the potency of the drug. Generally, the more potent the 
compoimd the lower the plasma levels necessary to achieve efficacy. 

EXAMPLES 

The examples below are not limiting and are merely representative of various 
aspects and features of the present invention. The examples below demonstrate the 
isolation and characterization of the protein kinases of the invention. 

EXAMPLE 1: Isolation of cDNA clones Encoding Novel Mammalian Protein Kinases 
Materials and Methods Identification from cDNA databases and isolation of clones 
encoding novel protein kinases 

Novel kinases were identified fi^m the public EST databases using a Hidden 
Markov model, abbreviated HMM (Krogh, A., Brown, M., Mian, L S., Sjolander, K., and 
Haussler, D. 1994, Hidden Markov models in computational biology: Applications to 
protein modeling. J. Mol Biol, 235:1501-1531). The model was built with 70 
manmialian and yeast kinase catalytic domain sequences. These sequences were chosen 
from a comprehensive collection of kinases such that no two sequences had more than 
50% sequence identity. ESTs were translated in six open reading frames and were 
searched against the model. ESTs that had a score of at least 10 against the HMM were 
then masked for repetitive sequences and vectors and were clustered using MSA. The 
resulting contigs were searched against known kinases to identify EST clones that encode 
novel kinases. 

Approximately 40% of the ESTs encoding p>otentially novel kinases did not 
correspond to the correct EST upon sequence analysis. Most of these discrepancies were 
resolved by ordering additional clones, however,. 14 remained unavailable. These 14 ESTs 
were amplified from a variety of single-stranded cDNA sources with primers derived from 
the corresponding EST entry as shown on Table 5. The PGR product was subcloned into a 
bluescript vector, digested to confiam the presence of a correct size insert and sequenced. 
Full sequencing of EST and PGR was carried out using a cycle sequencing Big-dye kit 
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with AmpliTaq DNA Polymerase, FS (ABI, Foster City, CA). Sequencing reaction 
products were run on an ABI Prism 377 DNA Sequencer. 

Table 5: Primers used to clone PGR products corresponding to novel kinases 





1D# 


m# 


Parent 


5' primer 


3' primer 


sp 


na 


aa 


Sequence 


Sequence* 


Sequence* 


H 


33 


153 


2R22-5-11 


GAGATCGRNTTYAARGA 
RTTYGA 


TGTCACNCCNAGNSWCCAN 
AYRTT 


M 


81 


200 


5R57_10_2_ 
m TESK2_m 


GCTGCTGGACAGTGACT 
TGTATTT 


GAAAGCAAAGCCTTCACAC 
CTT 


H 


67 


187 


5R69_J7_2_h 


CTCTCACCTCAGGAACT 
GO 


GCTTGCGGATCTTCTCA 


H 


46 


166 


SGK309_h 


GACATCCTGCCGGCCAA 
CTACG 


CGGCCCTGGAGCTGCATCA 
CTA 


M 


67 


228 


5R72_16.2_h 


TGCGCGACACCATTGAC 
CAG 


CTCAGGGCTTACATACAGA 
G 


H 


45 


165 


5R72_8_2_h 


AAAGGAGAACTACATTT 
TGAAAAT 


CrrCATCATCTCTAATACAT 
TGGTTGG 


H 


41 


161 


Z36720 


CAAATTAAGATCATTGA 
CTTTGGG 


GGAAACAAAGTCCTTGGCC 
TC 


H 


115 


234 


AL03I652- 
Pak6 


GTGGACATCTGGTCCCT 
CG 


GTAGGTCCTTCACTCTTGG 
AG 



• degenerate oligonucleotide residue designation: 
N=A,C,G otT 



R=AorG 
Y=CorT 
S - C or G 
W=AorT 

Full-length sequence extension of protein kinases using cDNA and genomic databases 

Extension of partial cDNA sequences to encompass the full-length open-reading 
frame was carried out by iterative blastn searching of the cDNA databases listed in Table 
6. All blastn searches were conducted using a blosum62 matrix, a penalty for a nucleotide 
mismatch of -3 and reward for a nucleotide match of 1 . The gapped blast algortihm is 
described in: (Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, Jinghui 
Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), "Gapped BLAST and 
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PSI-BLAST: a new generation of protein database search programs", Nucleic Acids Res. 
25:3389-3402), 

Table 6. Databases used for cDNA-based sequence extensions 



Database 


Database Date 


LifeGold templates 


Feb 2000 


LifeGold compseqs 


Feb 2000 


LifeGold compseqs 


Mar 2000 


LifeGold compseqs 


Apr 2000 


LifeGold fl 


Feb 2000 


LifeGold flft 


Apr 2000 


NCBI hiunan Ests 


May 2000 


NCBI murine Ests 


May 2000 


NCBI nonredundant 


May 2000 



5 

Extension of partial cDNA sequences to encompass the full-length open-reading 
frame was also carried out by iterative searches of genomic databases. Three methods 
were used. The first method made use of the Smith- Waterman algorithm to carry out 
protein-protein searches of the closest homologue or orthologue to the partial kinase. The 

10 target databases consisted of Genescan and open-reading frame (ORF) predictions of all 
human genomic sequence derived from the human genome project (HGP) as well as from 
Celera, The complete set of genomic databases searched is shown in Table 7 below. 
Genomic sequences encoding potential extensions were further assessed by blastp analysis 
against the NCBI nonredundant to confmn the novelty of the hit. The extending genomic 

15 sequences were incorporated into the cDNA sequence after removal of potential introns 
using the Seqman program from DNAStar. The default parameters used for Smith- 
Waterman searches were as shown next. Matrix: blosum 62; gap-opening penalty: 12; gap 
extension penalty: 2. Genescan predictions were made using the Genescan program as 
detailed in (Chris Burge and Sam Karlin 'Trediction of Complete Gene Structures in 

20 Human Genomic DNA", JMB (1997) 268(l):78-94), ORF predictions from genomic 
DNA were made using a standard 6-frame translation. 
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The second method for genomic sequence-based extensions made use of tBlastn 
searches of the homologue or orthologue to the partial kinase against the cDNA databases 
Hsted in Table 7. The recognition of significant hits in these databases made possible to 
identify bridging partial cDNA clones. The iterative application of the two methods made 
5 possible the assemblage of the virtual full-length sequence for a large number of the 
kinases presented in this appUcation. All tblastn searches were conducted using a 
blosum62 matrix, a penalty for a nucleotide mismatch of -3 and reward for a nucleotide 
match of 1. 

The last method for defining cDNA extensions firom genomic sequence used 
1 0 iterative searches of genomic databases through the Genescan program to predict exon 

splicing and the Genewise program (http://www.sanger.ac.uk/Software/Wise2/ ) to predict 
potential ORFs based on homology to the closest orthologue/homologue. 
Table 7. Databases used for genomic-based sequence extensions 



Database 


Number of entries 


Database Date 


Celerav. 1-5 . 


5,306,158 


Jan 19/00 


Celerav.6-10 


4,209,980 


Mar 24/00 


Celerav. 11-14 


7,222,425 


Apr 24/00 


Celerav. 15 


243,044 


May 14/00 


HGP all Genescan 


25,885 


Apr 04/00 


HOP; Phase 0 


4,944 


May 04/00 


HGP; Phase 1 


28,478 


May 05/00 


HGP; Phase 2 


1,508 


May 04/00 


HGP; Phase 3 


9,971 


May 05/00 



15 

Virtual Extensions 

Human AA826850 (SEQ ID NO: 3, SEQ ID NO: 124) 

Blastn analysis of the partial AA826850 sequence revealed an extension to 

encompass the complete ORE in the Incyte EST 238299.1. A frame-shift correction at 
20 position 595 of this EST (marked by X in NA sequence) generated an iminterrupted ORE. 
Human AA960957 (SEQ ID NO; 4, SEQ ED NO: 125) 



WEST 



wo 00/73469 



109 



PCT/USOO/14842 



Since the initial filing of this appUcation, the partial AA960957 sequence appeared 
in the public database as the full-length gene for a protein kinase encoded by a gene that 
maps adjacent to the eve (AJ250839) (elUs-van creveld syndrome and weyers acrodental 
dysostosis) gene jfrom 4p 1 6. 1 . 
5 Human 5R79^6-l_h (SEQ ID NO: 5, SEQ ID NO:126) 

Blastn analysis of the partial 5R79-46-1 sequence revealed an extension to 
encompass the complete ORF in the Incyte EST 463894.6. Since the initial filing of this 
application, the fiill-length virtual 5R79-46-1 appeared in the pubhc database as the full- 
length gene for the TANK-binding kinase (TBKl) (Pomerantz,J.L. and Baltimore JD. 
10 (1999) EMBO J. 18 (23), 6694-6704). TBKl participates in NF-kB activation through the 
formation of a signaling complex with TRAF2 and TANK. 

Human AA305176 (SEQ ID NO: 6. SEQ ID NO:127) 
Blastn analysis of the partial AA305176 sequence revealed an extension to 
encompass the complete ORF in the Incyte EST 220937.1. 
15 Human AA256100 (SEQ ID NO: 8, SEQ ID NO:129) 

Blastn analysis of the partial AA256100 sequence revealed an extension to- 
encompass the complete ORF through the assembly of three partial clones: Incyte EST 
480815.6, KIAA0965 (BAA76809) and AA256100. 

Human AA210825 (SEQ ID NO: 9, SEQ ID NO: 130) 
20 Blastn analysis of the partial AA2 10825 sequence revealed an extension to 

encompass the nearly complete ORF through the assembly of three partial clones: Incyte 
EST 014721.7, and the NCBI EST's AWOl 1 58 and AA210825. An insertion of two "N's" 
at positions 1915 and 1916 generated an unintemipted ORF. Blastx analysis indicated the 
possibility of a start Met in the range of 400-450 nucleotides (i.e. compared to the closest 
25 homolog, human PKCmu (CAA53384. 1). HowevCT, no Met was found in this region; 
rather ORF ends in an in-fi-ame stop preceeded by the sequence 

"RGLLAPGDPPCPPPNPAPATPPSSRLPTELFSNFCDS". It is possible that part of the 
sequence covered by nucleotide positions 1-400 derived firom AWOl 158 comes firom an 
intron, explaining the absence of a start Met. 
30 Human AA127299 (SEQ ID NO:10, SEQ ID N0:131) 

No entries in the database extended this sequence. The 1684 bp insert of this EST 
contains a 1369 bp intron at the 3' end. Blastx and SW analysis of the 315 bp coding 
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region revealed homology to the extracatalytic C2 domain of PKC. This EST, may or may 
not encode a kinase. 

Human AA316804 (SEQ K) N0:1U SEQ ID NO:132) 

Since the initial fiUng of this application, the partial AA3 1 6804 sequence appeared 
in the public database as the full-length gene for the PKC family protein kinase EPK2 or 
PKCnu(AB015982). 

Human H19102 (SEQ ID N0:14, SEQ ID NO:135) 

Genewise and Genescan analyses of the partial HI 9102 sequence revealed an 
extension from the HGP phase 3 contig 3810672 to encompass the complete catalytic 
domain of this EST. Blastn analysis against the non-redundant database revealed that this 
gene is found in the cosmid AC005726 from chromosome 17. H19102 may encode a dual 
catalytic kinase given the homology to S6 kinase. Analysis of genomic sequence upstream 
of the 5' end of H19102 revealed a non-kinase gene oriented in the same polarity as 
H19102 suggestive of the start Met for H19102 being close to the 5' end of the H19102 
sequence. From this analysis it is deduced that the second catalytic domain of H19102, if 
present, is most likely located within the 47334-185,215 bp region of the genomic 
sequence of AC005726. 

Human AA476563 (SEQ ID NO: 15, SEQ ID NO: 136) 

Since the initial filing of this application, the partial AA476563 sequence appeared 
in the public database as the full-length gene for the protein kinase RPS6KC1 
(NM_012424) (Zhang, H. et al Genomics (1999) 61, 314-318), which is an S6 kinase 
mapping to 12ql2-ql3.1. 

Human AA626690 (SEQ ID N0:16, SEQ ID NO:137) 

Since the initial filing of this apphcation, the partial AA626690 sequence appeared 
in the public database as the full-length gene for the protein kinase RPS6KA6 (AF 184965) 
(Yntema3-G et al (1999) Genomics 62, 332-343), an S6 kinase commonly deleted in 
patients with complex X-Iinked (Xq21.1 ) mental retardation. 

Human AI215680 (SEQ ID NO: 17, SEQ ID NO:138) 

Since the initial filing of this implication, the partial AI215680 sequence appeared 
in the public database as the full-length gene encoding a hypothetical protein (AAD30182) 
from the locus AC006530.4 from chromosome 14. 

Human AA887783 (SEQ ID N0:21, SEQ ID NO:142) 
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Blastn analysis of the partial AA887783 sequence revealed an extension to 
encompass the nearly complete ORF through the assembly of three partial clones: Incyte 
415390R6 and the NCBI EST's AA887783 and N94726. Since the initial fiUng of this 
application, the nearly full-length virtual AA887783 sequence appeared in the public 
database as the full-length gene encoding SGK3 (AF 16903 5), a serum- and 
glucocorticoid-induced protein kinase (Kobayashi,T. et al (1999) Biochemical J. 344, 189- 
197. 

Human R47805 (SEQ ID NO:22, SEQ ID NO:143) 

A cDNA clone encoding the full-length ORF of R47805 was isolated using 
R47805 as a screening probe. A fiill-length form for R47805 has also appeared in the 
public database as 

PTK9L (NM_007284), an A6-related protein kinase. 

Human H60215 (SEQ ID NO:23, SEQ ED NO: 144) 

Blastn analysis of the partial H60215 sequence revealed an extension to encompass 
the complete ORF in the public EST AI275726. This was confirmed through the full insert 
sequencing of this EST (2,310 bp) which corresponds to the sequence under SEQ ED 
NO:144. 

A different stop codon was predicted for AI275726 compared to H6021 5 due to a 
single nucleotide insertion at position 1586 in AI275726. Evidence for the extra nucleotide 
comes from EST AH 9 1 922. 

SGK324_h orthologue of W30246_m (SEQ ID NO:24 , SEQ ID NO:145) 

Blastn, blastx and Smith- Waterman analyses of genomic databases revealed an 
extension to encompass the complete ORF corresponding to the human orthologue of 
murine W30246. Exons predicted from the following sequences were used for contig 
construction: Celera 17000189645083, 17000057549105 and 11000501939981; 
Incytel42404.1,HGP„7249119, Incyte 71 96489H1, Celera 11000501939981, 
17000028165594; Incyte 72491 19_3, Celera 17000035772368, 1 1000502081575 and 
17000140274329. The latter Celera sequence provides the N-terminus. 

Human AA383293 (SEQ ID NO:26, SEQ ID NO: 147) 

Blastn, blastx and Smith- Waterman analyses of genomic databases revealed an 
extension to encompass the complete ORF corresponding for AA383293. Exons predicted 
from the following sequences were used for contig construction: (numbers in parenthesis 
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refer to the aa sequence of the closest homolog (RU2S, NP_057440) used for the Smith- 
Waterman query): N-term from Incyte 6010175^2 (14-97), Incyte 6981981 (134-184) 
7596749 (186-232) Celera 17000020789545 (243-301) CAB75619.1 (310-341)-(56-145 
DCX homology) 6010175_2 , Celera 17000030058129 (241-262 DCX homology). 
5 Human AA021445 (SEQ ID NO:32, SEQ ID NO:152) 

Blastn analysis revealed an extension to encompass the nearly complete ORF 
corresponding for AA021445. Contig reconstruction was as follows: nucleotides 1-802 
from KIAA0999 (AB023216); nucleotides 803-4321 from full-insert sequence of 
AA021445. A pairwise alignment between the AA021445 and KIAA0999 revealed three 
10 inserts in the extracatalytic C-terminus of 48, 48 and 161 aminoacids. In addition, both 
AA021445 and KIAA0999 have 15 copies of a CAG repeat. Trinucleotide repeats are 
often found in genes that linked to neurodegenerative diseases. 
Human 2R22-55-1 (SEQ ID NO:33, SEQ ID NO: 153) 
Blastn analysis revealed an extension in the Incyte EST clone 321 074. 1 to 
1 5 encompass the complete ORF corresponding to 2R22-55-1 . 

Human orthologue of AA544838_m (SEQ ID NO:36, SEQ ID NO:156) 
tBlastn analysis identified the partial human KIAA0135 (U79240) clone as the 
human orthologue of murine AA544838, Blastn revealed an extension KIAA0135__h 
(U79240) to encompass the complete ORF. The full ORF was reconstructed from 
20 Incyte406786.5, KFZp430051 andKlAA0135 (U79240). 

Human orthologue of AI785735_m (SEQ ID NO:38, SEQ ID NO: 158) 
tBlastn analysis identified the partial human KIAA078 1 (ABO 1 8324) clone as the 
human orthologue of murine AI785735. Blastn revealed an extension KIAA0135_h 
(U79240) to encompass the complete ORF. The full ORF was reconstructed from Incyte 
25 986123.37 KIAA0781 (AB018324). 

Human AA207220 (SEQ ID NO: 39, SEQ ID NO:159) 
Blastn analysis revealed an extension to encompass the nearly complete ORF 
corresponding for AA021445. The full ORF was reconstructed fix>m Incyte 402740. 1 and 
AA207220. Frame corrections: deletion of 441 and 595 over Inc402740.1 seq based on 
30 blastx to keep frame open; two n insertions 940, 941 over AA207220 to keep frame open. 
Human AA426580 (SEQ ID NO:40, SEQ ID NO:160) 
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Since the initial filing of this appHcation, the partial AA426580 sequence appeared 
in the public database as the full-length gene encoding MAK-V (AJ271722) from 
chromosome 2 1 q22. 1 . 

Human 5R79.54.1 (SEQ ID NO: 41, SEQ E) N0:161) 
5 Genewise and Genescan analyses of the partial 5R79-54-1 sequence revealed an 

extension from genomic sequence to encode the full ORF for 5R79-54-1 . 

Human orthologue of AA542015_m (SEQ ID NO: 42, SEQ ID NO: 162) 

tBlastn analysis identified KIAA1297 (AB037718). Blastn extended the 
KIAA1297 sequence to provide the C-terminus through the Incyte 224074.1 EST. The 
1 0 partial ORF consists of a dual catalytic domain flanked by 6 Ig domains and 2 fibronectin 
repeats. Based on homology to the bt drosophila protein (AAF59316.1), the human form 
of AA542015 is expected to be missing 16 Ig domains. 

Human R19772 (SEQ ID NO:44, SEQ ID NO:164) 

The full-length ORF for R19772 was isolated by screening a cDNA library using a 
15 probe derived from R19772. Since the initial filing of this application, the R19772 

sequence appeared in the public database as the full-length gene encoding Trio (Duet) 
(AB011422). CDNA library screening revealed multiple isoforms for this gene which are 
summarized in the Table below. 

20 Tables. Isoforms for Rl 9772 



Kestrl Name 


Kestrl 
AAAcc# 


Isoform 
type 


Source 


Description'^ 


Trad puet) 


R19772 


B 


Skeletal 
muscle 


Deletion ofKat 124 










Deletion of Q at 616 










Substitution of E for G at 
762 






C 


Skeletal 
muscle 


Deletion of K at 124 










Deletion of Q at 616 










Substitution of E for G at 
762 
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Deletion of 32 aa (160-191) 






D 


Lung tumor 


Deletion ofQat 616 










Deletion of 32 aa (160-191) 






E 


Lung tumor 


Deletion ofQat 616 










Deletion of 32 aa (160-191) 



* reference amino acid position are with respect to sequence of Trad (ABOl 1422) 



Human AA435956 (SEQ ID NO:48, SEQ ID NO: 168) 
5 Blastn analysis revealed an extension to encompass the nearly complete catalytic 

region of AA435956. 5' end sequence extension was provided by genomic locus 
AC007242.3_h (range 44880-43801). Based on blastx analysis, the extended sequence 
encodes is full-length at the C-tenninus. 

Human AA397553 (SEQ ID NO: 51, SEQ ID NO:171) 
10 Since the initial filing of this application, the partial AA397553 sequence appeared 

in the public database as flie full-length gene encoding CRK7 (AF227198), a novel CDC2- 
- related protein kinase that colocalizes with interchromatin granule clusters. 
Human AA789239 (SEQ ID NO: 52, SEQ ID NO: 172) 

Since the initial filing of this apphcation, the partial AA789239 sequence appeared 
15 in the public database as the fiilMength gene encoding NKIAMRE (AF130372), a novel 
kinase deleted in himian leukemia. 

Human AA631990 (SEQ ID NO:55, SEQ ID NO:175) 
Blastn analysis revealed an extension to encompass the fiill-length ORF for 
AA631990. The full ORF was reconstructed fi^m 253847.5 and AA631990 and 
20 AA207220, Frame corrections: delete 1 C at 1380, delete 2N'is at 2033/2034. 
Human AA557536 (SEQ ID NO:56, SEQ ID NO:176) 
Blastn analysis revealed an extension to encompass full-length ORF for 
AA557536. The fiill ORF was reconstructed firom AA557536, celera 1 1000504061899 
and the Incyte 097089.1 EST. An 85bp intron was removed fi^m AA557536, 
25 Human N34132 (SEQ ID NO: 63, SEQ ID NO:183) 

Full sequencing of EST N34132 (1.3 kb) confirmed that this cDNA encodes a 
novel NEK-subfamily kinase. Blast analysis against the EST database showed that four 
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EST sequences (AA283140, AA283140, AA28291 1 and N5301 1) extended the sequence 
of N34132 at the 3' end to form a 2.31 kb contig. Blast analysis of the new contig against 
the nonredunat public database showed that the N34132 extended contig overl^ped 
(100% identity) over 228 bp at its 3' end with human KIAA0344 (AB002342), a 5, 787 bp 
5 cDNA encoding a 1246 aa polypeptide. The 5' 790 bp of the KIAA0344 cDNA (encoding 
the 58 N-tenminal protein sequence) were foimd to be divergent with respect to the 
extended 2.32 kb N34132 contig. Evidence that the extended N34132 contig (2.31kb) and 
KIAA0344 (AB002342) belong to the same gene is the following. First, blast analysis of 
the nucleotide sequences for N34132 and KIAA0344 against the NRN database confirmed 

10 that these cDNA's are transcribed from the same genomic locus defined by two 

overlapping BACs (AC004765 and AC004803) from chromosome 12pl3.3. Second, full 
sequence determination of a PGR fragment amplified from single-stranded cDNA 
confirmed the jimction between the extended N34132 contig and KIAA0344_h 
(AB002342). The 462 PGR product was amplified with primers 

15 CTGGTGAACAGACAGTGCAG (5* primer) and GAGATTGTACTACTGGGTGTG (3' 
primer) designed firom the N34132 extended contig and KIAA0344 sequences, 
respectively. The region of N34132 containing the start Met was isolated by PGR from a 
testis cDNA library (Glontech). 

Human 5R69-17-2 (SEQ ID NO:67, SEQ ID NO:187) 

20 The fixll-length ORF for 5R69-17-2 was isolated by screening a cDNA library 

using a probe derived from 5R69-17-2. 

Human H8581 1 (SEQ ID NO:68, SEQ ID NO:188) 

Tblastn, Smith- Waterman and blastn analyses using cDNA databases revealed an 
extension to encompass fiill-length ORF for H8581 1. The full ORF was reconstructed 
25 from Incyte ESTs 202971.8, 034583.3 and 034583.1 and public ESTs H8581 1 and 
AI570599. 

Human R43524 (SEQ ID NO:73, SEQ ID NO: 192) 

Blastn analysis revealed an extension to encompass the complete catalytic region 
and the C-tenninus of R43524. Since the initial filing of this ^plicatiori, the partial 
30 R43524 sequence appeared in the public database as the fiill-length gene encoding the 
heme-regulated initiation factor 2-alpha kinase (HRI) (AF181071). 
Human AA088547 (SEQ ID NO:78, SEQ ID NO:197) 
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Genewise and Genescan analyses of genomic databases revealed an extension to 
encompass the complete ORF for AA088547. 

Human orthologue of AA139478_m (SEQ ID NO:80, SEQ ID NO:199) 

Tblastn identified the Incyte 21 1475.1 as the potential full-length human 
orthologue of murine AA 139478 

Human AA232253 (SEQ ID NO:82, SEQ ID NO:201) 

The full-length ORF for AA232253 was isolated by screenmg a cDNA library 
using a probe derived fiom AA232253. Since the initial filing of this appUcation, the 
AA2322S3 sequence appeared in the public database as the full-length gene encoding SLK 
(ABOl 1422). SLK is a stress-regulated mixed lineage kinase-like protein that activation 
of Rac and induction of apoptosis. cDNA library screening revealed multiple isoforms for 
this gene which are summarized in the Table below. 

Table 9. Isoforais for AA232253 



Kestrl 
Name 


Kestrl AA 
Acc# 


Isoform 
type 


Description* 


MLK4 


AA232253 


MLK4 


Substitution of C for W at 346 






MLK4B 


Different Cterm (332-800); seq in MLK4B is as 
shown in * 



* C-terminus specific to MLK4B 

LPLAARMSEESYFESKTEESNSAEMSCQITATSNGEGHGMNPSLQAMMLMGFGDI 
FSMNKAGAVMHSGMQINMQAKQNSS 
20 KTTSKIUlGKKVNMAIXjFSDFDi^EGDDDDDDDGEEED>roMDNSE 

Human H97685 (SEQ ID NO:84, SEQ ID NO:203) 
Blastn analysis revealed an extension to encompass the full-length ORF for 
H97685. The full ORF was reconstructed from Incyte 474824.1 and the public ESTs 
25 H97685andM62021. 

Human AI052250 (SEQ ID NO:87, SEQ ID NO:206) 



5 



10 
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Blastn analysis revealed an extension to encompass the full-length ORF for 
AI052250. The full ORF was reconstructed from Incyte 396868.1, the public partial 
cDNA FLJ10074 (minus intron) and the public ESTs and the public ESTs AI052250 and 
H97685, AI499220 and M62021 . 
5 Human AA278842 (SEQ ID NO:88, SEQ ID NO:206) 

A nearly full-length cDNA (FL4F12) for AA278842 was isolated by screening a 
cDNA library using a probe derived from AA278842. A full-length virtual ORF was 
generated using FL4F12 and AA278842. 

Human AA599286 (SEQ ID NO:89, SEQ ID NO:208) 
10 Since the initial filing of this appUcation, the partial AA599286 sequence appeared 

in the public database as a fiilHength ORF (AK000342). 

Human AA425725 (SEQ ID NO:90, SEQ ID NO:209) 

Since the initial filing of this appHcation, the partial AA425725 sequence appeared 
in the public database as MSSKl, a serine kinase gene located fix)m human chromosome 
15 Xq28. 

Human SGK022 orthologue of AA060026_m (SEQ ED N0:91, SEQ ID NO:210) 
Tblastn, Smith- Waterman and blastn analyses of cDNA and genomic databases 

databases revealed a potential human orthologue for murine AA060026. The full-length 

ORF for SGK022 was reconstructed from genomic locus AC022307. 
20 Human AA399669 (SEQ ID NO:93, SEQ ID NO:212) 

Blastn analysis revealed an extension to encompass the full-length ORF for 

AA399669. The full ORF was reconstructed as follows: sequence 1-1007 from 

AL136295.2; sequence 1008 -23 19 from AA399669 and Incyte 428177.1. 
Human AA883975 (SEQ ID NO:95, SEQ ID NO:214) 
25 Genescan and Genewise analyses of the genomic databases revealed an extension 

for AA883975 to encompass the full-length ORF 

Human AA905446 (SEQ ID NO:96, SEQ ID N0:21 5) 

Tblastn, Smith- Waterman and blastn analyses of cDNA and genomic databases 

databases revealed an extension for AA905446 to encompass the full-length ORF. For the 
30 Smith-Waterman analysis murine STK22 ( NP_033462) was used as the closest 

orthologue. Contig formation: range 162133-163687 from HGP_h 6921333_9; removed 

intron (146-893) predicted from blastx analysis. 
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Human H29974 (SEQ ID NO: 97 SEQ ID NO:216) 

Blastn analysis revealed an extension to encompass a complete catalytic ORF for 
AA399669. The nearly full-length ORF was reconstructed using Incyte 213829.1 and 
H29974. 

5 Human AA21531 1 (SEQ ID NO:99, SEQ ID NO:218) 

Blastn analysis revealed an extension to encompass the full-length ORF for 
AA21531. The full ORF was reconstructed from Incyte 067584.1, 022456.1, AA21531 1 
and the reverse complement of CPG_043208. 

Human AA018361 (SEQ ID NO:100, SEQ ID NO:219) 
10 The full-length ORF for AA018361 was isolated by screening a cDNA hbrary 

using a probe derived from AAOl 8361 . This yielded clone Sug4-30. Clone Sug4-30, Uke 
multiple, independent cDNA clones contained a 181bp intron. The existence of intron-less 
RNA*s was confirmed by a PGR reaction that generated a product that upon sequence 
analysis skipped the intron region. The full-length virtual ORF for AAOl 8361 was 
1 5 generated through a contig between ALII 7482 (seq 1 -367) and the sequence for clone 
Sug4-30. 

Human orthologue of AA396601_m (SEQ ID NO:106, SEQ ID NO:225) 
tBlastn and Smith-Waterman analyses of genomic sequence revealed an extension 

to encompass the fiill catalytic region for the human orthologue of AA396601 . The ORF 
20 was reconstructed from Incyte 018653.9 (7261449H1, 6891740J1) and genomic sequence 

CPG_040010. 

Human orthologue of AA671275_m (SEQ ID NO:108, SEQ ID NO:227) 
Since the initial filing of this £q5plication, a potential human orthologue for murine 
AA671275 appeared in the public database as the full-length ORF for vaccinia related 
25 kinase 3 (BAA90769). 

Human H05721 (SEQ ID N0:1 1 1, SEQ ID NO:230) 

Genescan and Genewise analyses of genomic sequence revealed an extension to 
encompass the full-length ORF for H05721. 

Human AI086865 (SEQ ID NO: 1 12, SEQ ID NO:23 1) 
30 Genescan and Genewise analyses of genomic sequence revealed an extension to 

encompass the full-length ORF for AI086865. The fiill-length ORF was reconstructed 
from Celera 17000102901516, Incyte 243269.1 and public AL1377531. 
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Human AA836348 (SEQ ID N0:1 13, SEQ ID NO:232) 
Grenescan and Genewise analyses of genomic sequence revealed an extension to 
encompass the full-length ORF for AA836348. 

Human R86668 (SEQ ID N0:14, SEQ ID NO:233) 

The full-length ORF for R86668 was isolated by screening a cDNA library using a 
probe derived from R86668. Since the initial filing of this application, the R8668 sequence 
appeared in the public database as the full-length gene mitogen-activated protein kinase 
kinase kinase 6 (MAP3K6) (NM_00467). 

Human 2R41-9-4 (SEQ ID NO: 16, SEQ ID NO:235) 

The full-length virtual ORF for 2R41-9-4 was generated using genomic sequence 
to provide the Nterminus for the partial ORF predicted from clone 2R4 1-9-4 

Table 10. Sequences deleted from the provisional patent due to duplication with other 
genes in the patent 



Prov. SEQ ID NO: (na) 


Prov. SEQ ID NO: (aa) 


160 


196 


213 


214 


215 


216 


122 


126 


119 


123 


148 


184 


4 


20 


7 


23 


205 


206 


14 


30 


15 


31 


35 


56 


42 


63 


51 


72 


44 


65 


77 


■ 91 
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78 


92 


79 


93 


80 


94 


157 


193 



Results 

Table 1 documents the results from the analysis of the nucleic acid sequence data. 
From left to right the data presented is as follows. "Gene name" refers to the EST or PCR 
5 fragment that defined the novel kinase. "Species'* refers to the organism the sequence was 
derived from. "ID#'* refers to the nucleic acid and amino acid sequence ID number 
designation from this patent. "Kinase family "and "Kinase group" refers to the protein 
kinase classification defined by sequence homology and based on previously established 
phylogenetic analysis [Hardie, G. and Hanks S. The Protein Kinase Book, Academic Press 

10 (1995) and Hunter T. and Plowman, G. Trends in Biochemical Sciences (1977) 22:18-22 
and Plowman G.D. etaL (1999) Proc. Natl. Acad. Sci. 96:13603-13610)], "ORF Start", 
"ORF End", "ORF Length" refer to the open reading frame range and length as calculated 
by standard nucleic acid translation programs such as MapDraw (DNAStar). "DNA 
Repeats" refers to regions of low complexity sequence or repetitive elements such as Alu, 

15 LINE, SINE, and LTR sequences. The chromosomal location (CHR localization) for 37 
of the 1 10 novel protein kinases is shown on Table 1 (NA, not available). The rnethods 
for determining chromosomal position are outlined below, in Example 2. 

Table 2 documents the results from the analysis of the amino acid sequence data. 
From left to right the data presented is as follows. "Gene name" refers to the EST or PCR 

20 fragment that defined the novel kinase. "Species" refers to the organism the sequence was 
derived from. "ID#" refers to the nucleic acid and amino acid sequence ID number 
designation from this patent. '*Kinase family "and "Kinase group" refers to the protein 
kinase classification defined by sequence homology and based on previously established 
phylogenetic analysis [Hardie, G. and Hanks S. The Protein Kinase Book, Academic Press 

25 (1995) and Hunter T. and Plowman, G. Trends in Biochemical Sciences (1977) 22: 18-22 
and Plowman G.D. etaL (1999) Proc. Natl. Acad. Sci. 96:13603-13610)]. "nraa Score", 
'ID match aa", 'Identity", "Similar'', "nraa Match Acc#", Description" refer to the data 
obtained using a Smith- Waterman search of the amino acid sequence against the non- 
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redundant protein database (Matrix; PamlOO; gap open/extension penalties 14/1). "Kinase 
Domain Start", "Kinase Domain End", 'Trofile Start" and "Profile End" refer to data 
obtained using a Hidden-Markov Model to define catalytic range boundaries. The profile 
has a length of 261 amino acids, corresponding to the complete protein kinase catalytic 
5 domain. Proteins in which the profile recognizes a full length catalytic domain have a 
"Profile Start" of 1 and a "Profile End" of 261 . The boundaries of the catalytic domain 
within the overall protein are noted in the **Kinase Domain Start" and "Kinase Domain 
End" columns. 

10 The following abbreviations were used for kinases: 





AT^r^'n1'/^c^c ciomjl-rponliitiTio IntiJiQf* 




Ca9H-/cialmndulin-der)endent nrotein kinase 


CCRK 


Cell cvcle-related kinase 


CDK 


Cycl in-dependent kinase 




i^asem jonase 


DAPK 


Death-associated protein kinase 


DM 


myotonic dystrophy kinase 


Dyrk 


dual-specificity-tyrosine phosphorylating-regulated kinase 


GAK 


Cyclin G-associated kinase 


GRK 


G-protein coupled receptor 


GuC 


Guanylate cyclase 


HIPK 


Homeodomain-interacting protein 


IRAK 


Interleukin-1 receptor-associated kin 


MAPK 


Mitogen activated protein kinase 


MAST 


Micotubule-associated STK 


MLCK 


Myosin-light chain kinase 


MLK 


Mixed lineage kinase 


NIMA 


NimA-related protein kinase 


PKA 


cAMP-dependent protein kinase 


RSK 


Ribosomal protein S6 kinase 


RTK 


Receptor tyrosine kinase 
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SGK Serum and glucocorticoid-regulated kinase 
STK serine threonine kinase 
ULK UNC-5Mike kinase 



The following abbreviations were used for species 



TT 

rl 


nunion 


M 


jVLuruic 


R 


Kai 


rV 


rowipox virus 


Ml 


iVi. xnernioauioiropnicuin 




^aenomaouius eiegoos 


DM 


jjrosopniia nieianogasier 


Ub 


\./ryZa SatlVa 


CP 
or 


Qf»Vii'7r»c5if»r'Via'mnnvf*pQ "nnmnp 


•I'D 


'T*<»*lTi>ti\/mpTia "nvTifrtTTTiic 
X cu oil yjiidici uj^j.iiLFiiJu.i> 


rl 


rGllXnia lllilaUt 


NC 


Neurospora crassa 


MSV 


Medicago sativa 


MSV 


Moloney murine sarcoma virus 


SA 


Squalus acanthias 


cs 


Cucumis sativus 


GM 


Glycine max 


LL 


Lilium longiflorum 


TV 


Trichomonas vaginalis 


MP 


Mycoplasma pneumoniae 


DD 


Dictyostelium discoideum 


SC 


Saccharomyces cerevisiae 


MT 


Methanobacterium thermoautotrophicum 
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Domain and Motif Identification 

A Hidden Maikov model (HMM) (Krogh, A,, Brown, M., Mian, 1. S.. Sjolander, 
K., and Hanssler, D. (1994). Hidden Markov models in computational biology: 
Applications to protein modeling. J. MoL Biol., 235:1501-1531) was used to identify, 
5 both catalytic and extracatalytic domains. Table 4 shows extra-catalytic domains that 
were identified using the HMM program. Other domains such as coiled-coil and pest 
motifs were identified as described next. 

Potential coiled-coil domains were identified using the COILS program 
(www.ch.embnet.org/software/COILS_form.html). The matrix used was MTEDK with 
10 windows of 14, 21, 28 amino acids. Only regions scoring 0.5 or higher were considered to 
have potential coiled-coil domain region. 

Protein sequences containing potential pest motifs were identified using the 
program PESTfind (www.at.embnet.org/embnet/tools/bio/PESTfind/). PEST regions in 
proteins are by definition sequences that tend to be rich in proline, glutamic or aspartic 
1 5 acid, argininine and histidine; they have been associated with increased protein turnover 
rates (Rogers S. et al (1986) Science 234, 364-368. The algorithm defines PEST 
sequences as hydrophilic stretches of amino acids greater than or equal to 12 residues in 
length. Such regions contain at least one P, one E or D and one S or T. They are flanked 
by lysine (K), arginine (R) or histidine (H) residues, but positively charged residues are 
20 disallowed within the PEST sequence. PESTfind produces a score ranging form about -50 
to +50. By definition, a score above zero denotes a possible PEST region; a value greater 
than +5 defmcs a high probability that there is a PEST domain. 
Identiflcation of potential coiled-coil domains and PEST domains in N34132 

Potential coiled-coil domains were identified in N34132 (SEQ ID NO:183) using 
25 the COILS program. Only regions scoring 0.5 or higher were considered to have potential 
coiled-coil domain region. The amino acid positions within N3423 1 scoring for potential 
■ coil-coil regions are shown below. 
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Table 11 coiled-coil domains predicted forN34132 



C iled-c il Region 


Amino acid range 


Length (aa) 


1 


124-147 


24 


2 


437-451 


15 


3 


495-526 


32 


4 . 


1,723-1,749 


27 



Potential PEST domains were identified in N34132 xising PESTfind, a value 
5 greater than +5 defines a high probability that there is a PEST domain. The amino acid 
positions within N34132 scoring for potential PEST regions are shown below. 



Table 12 Potential Pest domains identified in N34132 



PEST Region 


Score 


Amino acid range 


Amino Acid Length 


1 


+ 4.91 


54-95 


42 


2 


+11.4 


537-570 


34 


3 


+31.08 


1293-1304 


12 


4 


+10.15 


1543-1565 


23 


5 


+ 6.17 


1698-1732 


35 



10 EXAMPLE 2. Chromosomal Localization of Novel Mammalian Protein Kinases 

Materials and Methods 

Several sources were used to find information about the chromosomal localization 

of each of the genes described in this patent. First, the accession number for the nucleic 

acid sequence was used to query the Unigene database. The site containing the Unigene 
15 search engine is: http://www.ncbi.nlm.nih.gov/LJniGene/Hs.Home.html. Information on 

mop position within the Unigene database is imported fi-om several sources, including the 

Online Mendelian Inheritance in Man (OMIM, 

http://www.ncbi.nlm.nih.gov/Omim/searchomim.html), The Genome Database 
(http://gdb.infobiogen.fi:/gdb/simpleSearch.html), and the Whitehead Institute human 
20 physical map (ht^://carbon.wi.rmt.edu:8000/cgi-bin/contig/sts_info?database===release). 
For example, searching Unigene with W56561 , an EST for a MAK-like kinase, the 
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following information is retrieved: Chr.l4, D14S65-qTEL. The location of this gene on 
an "ideogram" of the cytogenetic map of chromosome 14 is also provided, showing that 
W56561 maps to the bottom of chromosome 14, between 14q31 and 14qTel. If Unigene 
has not mapped the EST, then the nucleic acid for the gene of interest is used as a query 
5 against databases, such as dbsts and htgs (described at 

http://www.ncbiJihn.nih.gov/BLAST/blast__databases.html) containing sequences that 
have been mapped already. The nucleic acid sequence is searched using BLAST-2 at 
NCBI (http://www.ncbi.nlm.nih.gov/cgi-bin/BLAST/nph-newblast) and is used to query 
either dbsts or htgs. In addition to the Whitehead and GDB sites mentioned above, 

10 Stanford University maintains a useful site for chromosomal mapping from STS data 
(http://www-shgc.stanford.edu/RH/rhserverformnew.html). Matches in htgs are often 
resolved immediately because the genomic region hit is aimotated in the htgs entry. If an 
exact match match is found (defined roughly as 99% identity over a region of about 100 
base pairs or longer, excluding any repetitive sequence), then the mapped position of the 

15 entry in the database is assigned to the original kinase query. Once a cytogenetic region 

has been identified by one of these approaches, disease association is established by 
searching OMIM (see above for URL) with the cytogenetic location. OMIM maintains a 
searchable catalog of cytogenetic map locations organized by disease. A thorough search 
of available literature for the cytogenetic region is alo made using Medline 

20 (http://www.ncbi.nlm.nih.gov/PubMed/medline.html). References for association of the 
mapped sites with chromosomal abnormalities found in hxmian cancer can be found in: 
Knuutila,era/.,Am JPathol, 1998, 152:1107-1123. 

Resuhs 

25 The chromosomal location for 37 of the 1 10 novel protein kinases is shown on 

Table 1. Three of the novel protein kinases were mapped to regions associated with 
cancer amplicons, as shown on this table. The regions were also cross-checked with the 
Mendelian Inheritance in Man database, which tracks genetic information for many human 
diseases, including cancer. References for association of the mapped sites with 

30 chromosomal abnormalities found in human cancer can be found in: Knuutila, et al.. Am J 
Pathol, 1998, 152:1 107-1 123. Association of these mapped regions with other diseases is 
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documented in the Online Mendelian Inheritance in Man (OMIM) 
(http ://www.ncbi,nhn.nih.gov/htbin-post/Omim). 

F.YAMPT. F 3: Generation of Specific Immunoreagents 

5 Materials and Methods 

Peptide sequences to extra-catalytic regions of novel kinases are chosen which are 
not homologous to other known kinases based on a Smith Waterman homology search 
against the non-redundant protein database and predicted to be antigenic based on the 
DNAStar Protean program. These peptides are conjugated to KLH using Glutaraldehyde. 

10 Rabbits are immunized with the KLH-peptide conjugates by four injections three 

weeks apart. The rabbits are bled ten and fourteen days following the third injection and 
bled out ten days after the fourth. The serum is checked against the peptide by ELIS A. 



Table 13. Peptides to be used as immunogens for raising antibodies 



Clone 
Name 


SEQID 
NO(aa) 


Peptide Sequence 


Amino Location 


AA8256850 


124 


KSRDNSRDSSQSEND 


339-353 






TEKLKRSQDLPREPLP 


372-386 






RGWRPYDfflS 


223-232 


5R79-46-1 


126 


FEGPRRNKEVMYK 


224-236 






KDDYNETVHKKTE 


451-463 






GTHPKDRNVEKLQ 


541-553 






EVSKYQEYTNELQET 


643-657 


AA256100 


129 


IDDTSNFDDFPESDI 


405-419 






TEPDYKSKDWVFL 


427-439 






EEKKLRRSQHARKET 


61-75 


AA210825 


130 


SNKDTLRKRHYWRLD 


507-521 






RHTTRKSSTTLRE 


488-500 






FQNNTTNRYYKEIPL 


528-542 






GKHRKTGRDVAVK 


668-680 






FPTKQESQLRNE 


687-698 
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AA316804 


132 


ESHVHQEPSKRIPS 


239-252 






HTJCRKSSTMVKEGW 


409-422 






PSDLDVERDEEAVK 


375-388 






SPGQGKDHKDLSTSI 


543-557 


R47805 


143 


EPVGRWDQDYDRAVL 


44-58 






KPKGPGGKRGHKRLI 


325-339 






PTDVAQLPSRVPRDA 


219-233 


AA234451 


167 


DPFDWEKTGNDGSLT 


293-307 






HPRPQEKDVWEE 


374-385 






RENTDEVFPDEQLSD 


340-354 






RSEITQPDRDIPLVR 


427-441 


AA460132 


180 


LKSYSTSSKKARPVL 


222-236 






KKLDEVRLRGRKRSM 


237-251 






ETEKTAQGLSNLAKT 


131-145 


N34132 


183 


SGRRRRFIXSKGSKS 


1848-1862 






PGTAPSKPPLTKAPV 


1474-1488 






VDSDTQPKAPGIDD 


1365-1378 






AHSLDKTSHSSTTGL 


1253-1267 


5R69-17-2 


187 


GTTREKTDRVKST 


178-190 






HSEAPELHGKIRSSN 


138-152 






DETVTPPQFSIV 


87-98 






QYDVKSEIYS 


204-213 


AA278842 


206 


TVDPEKSVRDQAFKA 


515-529 






DSSTADRWDDEDWGS 


637-651 






SVSEDPTQLEEVEKD 


539-553 


AA836348 


232 


NAPTKRPRSSTVTEA 


323-337 






LDSEEDYYTPQKVDV 


514-528 






GDKASYRQPKHVEKL 


409-423 
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EXAMPLE 4. Expression analysis of Novel Mammalian Protein Kinases 
GENE EXPRESSION ANALYSIS 
Tissue Arrays 

"cDNA libraries" derived from a variety of sources were immobilized onto nylon 
5 membranes and probed with 32P-labeled cDNA fragments derived from the gene(s) of 
interest. 

Total RNA or mRNA was used as template in a reverse transcription reaction to 
generate single-stranded cDNAs (ss cDNA) that were tagged with specific sequences at 
each end. An oligo dT primer containing a specific sequence (CDS: 

10 AAGCAGTGGTAACAACGCAGAGTACT30VN (V=A,G,C N=A,G,C,T)) anneals at 
the polyA track at the 3 ' end of the niRNA and the reverse transcriptase (MMLV 
RnaseH-) transcribes the antisense strand imtil it reaches the end of the RNA strand when 
it adds additional C residues. If a primer (SMII: 
AAGCAGTGGTAACAACGCAGAGTACGCGGG or ML2G: 

15 AAGTGGCAACAGAGATAACGCGTACGCGGG) ending with 3 Gs is added, it anneals 
to the added Cs and the MMLV recognizes the rest of the primer sequence as template and 
continues transcription. As a result, the synthesized cDNAs contain specific sequence tags 
at both the 5' and the 3' end. When the 5' and the 3' ends are tagged with the same 
sequence (CDS and SMII) it is referred to as "synmietric." When the 5' end is tagged 

20 with a different sequence than the 3' end (CDS and ML2G) is referred to as "asymmetric" 
A double-stranded "cDNA Hbrary " is then generated by PCR amplification using the 
3'PCR and ML2 primers (3' PCR: AAGCAGTGGTAACAACGCAGAGT and ML2: 
AAGTGGCAACAGAGATAACGCGT) that anneal to the added sequence tags. 

The amphfied "cDNA libraries" were manually arrayed onto nylon membranes 

25 with a 384 pin rephcator. The DNA was denatured by alkali treatment, neutraUzed and 
cross-linked by UV light. The arrays were pre-hybridized with Express Hyb (Clontech) 
and hybridized with 32P labeled probes generated by random hexamer priming of cDNA 
fragments corresponding to the genes of interest After washing, the blots were exposed to 
phosphorimaging cassettes and the intensity of the signal was quantified. The amount of 

30 the DNA on the arrays was also quantified by treating non-denatured or denatured arrays 
with Syber Green I or Syber Green II respectively (1:1 00,000 in 50mM Tris, pH8.0) for 2 
minutes. After washing with 50mM Tris, pH8.0, the fluorescent emission was detected 
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with a phosphorimager (Molecular Dynamics) and quantified. The amount of the arrayed 
DNA was used to normalize the hybridization signal and the corrected values are tabulated 
in Table 3. 

5 Results 

The resxilts of the microarray expression analysis of the protein kinases presented 
in this application is shown in Table 3. Data presentation from left to right is as follows: 
'Tissue'*: tissue type of the cDNA; 'Tumor sym", indicates that the tissue is derived from 
a tumor, "sym" refers to the fact that the 5' and 3* primers used to make the sample are the 

10 same; ''Normal Sym", indicates normal tissue was used to make the sample, with 

symmetric primers as described above; *Timior lo", indicates that primary tumor tissue 
was used to make the cDNA; 'Tumor cells", indicates that these cDNA samples were 
made from cultured tumor cells; "Normal" indicates that these samples are derived from 
normal tissue or cell lines; "Endos", indicates that these samples are derived from 

15 endotheliura-related tissue sources; '•p53" refers to the status, mutant or wild-type, of the 
p53 gene in the source samples. Normalized expression values are presented for each 
gene referred to by its SEQ rD# on the subsequent columns. Genes represented in 
expression Table 3 are: SEQ ID N0:3 (AA826850), SEQ ID N0:5 (TBKl), SEQ ID NO:6 
(AA305176), SEQ ID NO:8 (AA256100). SEQ ID NO:9 (CAB43292), SEQ ID NO:ll 

20 (EPK2), SEQ ID NO:12 (PKNbeta), SEQ ID N0:14 (H19102), SEQ ID N0:16 (RSK4), 
SEQ ID NO:l7 (AAD30182), SEQ ID NO:20 (SGK2), SEQ ID NO:22 (PTK9L), SEQ ID 
NO:26 (AA383293), SEQ ID NO:29 (DRAK2), SEQ ID N0:31 (DRAKl), SEQ ID 
NO:032 (AA015726), SEQ ID NO:40 (MAK-V), SEQ ID NO:044 (TRAD), SEQ ID 
NO:044 (TRAD), SEQ ID NO:45 (AA454060), SEQ ID NO:47 (AA234451), SEQ ID 

25 NO:48 (AA436054), SEQ ID NO:49 (AA626859), SEQ ID N0:51 (KIAA0904), SEQ ID 
NO:52 (AA789239), SEQ ID NO:54 (CCRK). SEQ ID NO:55 (CLK4), SEQ ID NO:56 
(AA557536), SEQ IDNO;57 (W56561), SEQ IDNO:60 (AA579641), SEQ IDNO:63 
(NEK7), SEQ ID NO:66 (CAMKKB), SEQ ID NO:68 (HIPK2), SEQ ID NO:72 
(R19609), SEQ ID NO:'73 (HRI), SEQ ID NO:78 (AA088547), SEQ ID NO:79 

30 (AA449542), SEQ ID NO:082a (MLK4), SEQ ID NO:82 (MLK4b), SEQ ID NO:84 
(RIP4), SEQ ID NO:88 (AA278842), SEQ ID NO:89 (AA195964), SEQ ID NO:90 
(MSSKl), SEQ ID NO:93 (TSK4), SEQ ID NO:94 (AI025291), SEQ ID NO:95 
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(AA948538), SEQ ID NO:96 (AA905446), SEQ ID NO:97 (H85389), SEQ ID NO:100 
(AA018361), SEQ ID NO:101 (AA311714), SEQ ID NO:110 (AA452647), SEQ ID 
N0:11 1 (AA310219), SEQ ID N0:112 (AI086865), SEQ ID N0:1 14 (MEKK6), and SEQ 
IDNO:116(SuRTK106). 

5 

EXAMPLE 5. Kinase assays for Erk. JNKl and p38 MAP kinases 

293T cells were transiently transfected with HA- p38 or co-transfected with Flag- 
tagged wt MLK4A, kinase-dead MLK4A, wild-type MLK4B or kinase-dead MLK4B 
using Lipofectamine 2000 (Lifetech). Cells were lysed 36 hr post-transfection. Cell 

10 lysates normalized to contain equivalent amounts' of HA-p38 were immunoprecipitated 
with anti-HA antibody (Mab HA- II, Babco). Immunoprecipitates were split in two 
portions, one portion was Western-blotted with anti- HA antibody and the other with a 
phospho-specific p38 antibody (Promega) to detect activated levels of p38. Activation of 
Erkl and Jnkl was measured similarly. (This example applies to AA232253 (SEQ ID 

15 NO:82, SEQ ID NO:201).) 



Results : 

In transient assays wild-type MLK4A and MLK4B (but not kinase-inactive 
MLK4A(K45M) or MLK4B(K45M)) activate Eric, JNKl and p38 MAP kinases. 
20 EXAMPLE 6. RACl guanine-exchange factor assay 

293T cells were transiently transfected with HA-Racl or co-transfected with Flag- 
tagged Duet C, Duet E, Dbl and HA-Tiam-L Cells were lysed 36 hour post-transfection. 
Cell lysates normalized to contain equivalent amounts of Racl were afiBnity precipitated 
with immobilized GST-PBD (p21-binding domain of Pak3). Bound proteins were 
25 Western blotted and probed with anti-HA antibody to detect levels of activated Racl . 
((This example applies to R199772 (Trad/Duet)(SEQ ID NO:44, SEQ ID NO:164).) 

Results : 

Duet C and Duet E both act as guanine nucleotide exchange factors on Racl . 
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CONCLUSION 

One skilled in the art would readily appreciate that the present invention is well 
adapted to carry out the objects and obtain the ends and advantages mentioned, as well as 
5 those inherent therein. The molecular complexes and the methods, procedures, treatments, 
molecules, specific compounds described herein are presently representative of preferred 
embodiments are exemplary and are not intended as limitations on the scope of the 
invention. Changes therein and other uses will occur to those skilled in the art which are 
encompassed within the spirit of the invention are defined by the scope of the claims. 

10 It will be readily apparent to one skilled in the art that varying substitutions and 

modifications may be made to the invention disclosed herein without departing from the 
scope and spirit of the invention. 

All patents and publications mentioned in the specification are indicative of the* 
levels of those skilled in the art to which the invention pertains. 

15 The invention illustratively described herein suitably may be practiced in the 

absence of any element or elements, limitation or limitations which is not specifically 
disclosed herein. Thus, for example, in each instance herein any of the terms 
"comprising", "consisting essentially of and "consisting of may be replaced with either 
of the other two terais. The terms and expressions which have been employed are used as 

20 terms of description and not of limitation, and there is no intention that in the use of such 
terms and expressions of excluding any equivalents of the features shown and described or 
portions thereof, but it is recognized that various modifications are possible within the 
scope of the invention claimed. 

hi particular, although some formulations described herein have been identified by 

25 the excipients added to the formulations, the invention is meant to also cover the final 
formulation formed by the combination of these excipients. Specifically, the invention 
includes formulations in which one to all of the added excipients undergo a reaction 
during formulation and are no longer present in the final fomiulation, or are present in 
modified forms. 

30 In addition, where features or aspects of the invention are described in terms of 

Markush groups, those skilled in the art will recognize that the invention is also thereby 
described in terms of any individual member or subgroup of members of the Markush 
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group. For example, if X is described as selected from the group consisting of bromine, 
chlorine, and iodine, claims for X being bromine and clauns for X being bromine and 
chlorine are folly described. 

Other embodiments are within the following claims. 
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What is claimed is: 

CLAIMS 

1 . An isolated, enriched, or purified nucleic acid molecule encoding a kinase 
polypeptide selected from the group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ 
5 ID NO:124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ ID NO:127, SEQ ID NO: 128, SEQ 
ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ ID NO: 132, SEQ ID NO: 133, SEQ 
ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO: 138, SEQ 
ID NO:I39, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ 
ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ 

10 ID NO:149i SEQ ID NO:150, SEQ ID N0:151, SEQ ID NO:152, SEQ ID NO:153, SEQ 
ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ 
ID NO:159, SEQ ID NO:160. SEQ ID NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ 
ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ 
ID NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ 

15 ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177. SEQ ID NO:178, SEQ 
ID NO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ 
ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ 
ED NO:189, SEQ ID NO:190, SEQ ID NO:191, SEQ ID NO:199, SEQ ID NO:193, SEQ 
ID NO:194, SEQ ID NO:195, SEQ ID NO:196. SEQ ID NO:197, SEQ ID NO:198, SEQ 

20 ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ED NO:203, SEQ 
ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ 
ID NO:209, SEQ ID NO:2I0, SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213. SEQ 
ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ 
ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ 

25 ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ 
ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ 
ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ 
ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. 
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2. The nucleic acid molecule of claim 1 , wherein said nucleic acid molecule 
comprises a nucleotide sequence that: 

(a) encodes a polypeptide comprising the amino acid sequence set forth 
in SEQ ID NO: 122, SEQ ID NO: 123, SEQ ID NO: 124, SEQ ID NO: 125, SEQ ID 

5 NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID 
N0:131, SEQ IDNO:132, SEQ ID NO: 133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID 
NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID 
NO: 141, SEQ ID NO:142, SEQ ID NO: 143. SEQ ID NO:144, SEQ ID NO: 145, SEQ ID 
NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID 

10 N0:151. SEQ IDNO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID 
NO:156, SEQ IDNO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID 
N0:16I, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:I64, SEQ ID NO:165. SEQ ID 
NO: 166, SEQ ID NO:167, SEQ ID NO: 168, SEQ ID NO:169, SEQ ID NO: 170. SEQ ID 
N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID 

15 NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ K) 
N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID 
NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID 
N0:191i SEQ ID N0:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID 
NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID 

20 NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID 
NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:2G9, SEQ ID NO:210. SEQ ID 
N0:211, SEQ ID NO:212, SEQ ID NO:213. SEQ ID NO:214, SEQ ID NO:215, SEQ ID 
NO:216. SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220. SEQ ID 
NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID 

25 NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID 
NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID 
NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239. SEQ ID NO:240, SEQ ID 
NO:241, and SEQ ID NO:242; 

(b) is the coinplement of the nucleotide sequence of (a); 

30 (c) hybridizes under highly stringent conditions to the nucleotide 

molecule of (a) and encodes a naturally occurring kinase polypeptide; 
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(d) encodes a kinase polypeptide having an amino acid sequence 
selected from the group consisting of SEQ ID NO:122, SEQ ID NO:123, SBQ ID NO:124. 
SEQ ID NO:125, SEQ ID NO:126, SEQ IDNO:127, SEQ ID NO:128. SEQ ID NO:129, 
SEQ ID NO:130, SEQ ID N0:131, SEQ ID NO:132, SEQ ID NO:133. SEQ ID NO:134, 
5 SEQ ID NO:135. SEQ ID NO:136, SEQ IDNO:137, SEQ ID NO:138. SEQ ID NO:139, 
SEQ ID NO:140, SEQ ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, 
SEQ ID NO:145, SEQ ID NO:I46, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, 
SEQ ID NO:150, SEQ ID N0:151, SEQ ID NO:152. SEQ ID NO:153, SEQ ID NO:154, 
SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, 

10 SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164. 
SEQ ID NO:165. SEQ ID NO:166. SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169i 
SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, 
SEQ ED NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, 
SEQ ID NO:180, SEQ ID N0:181, SEQ ED NO:182, SEQ ID NO:183, SEQ ID NO:184, 

15 SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, 
SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, 
SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ED NO:199. 
SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, 
SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, 

20 SEQ ID NO:210, SEQ ID N0:211, SEQ ED NO:212, SEQ ID NO:213, SEQ ID NO:214i 
SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, 
SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223. SEQ ID NO:224, 
SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229. 
SEQ ID NO:230, SEQ ID NO:231. SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, 

25 SEQ ID NO:235, SEQ ID NO:236, SEQ ED NO:237, SEQ ED NO:238, SBQ ID NO:239, 
SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, except that it lacks one or more, 
but not all, of a domain selected from the group consisting of an N-terminal domain, a 
catalytic domain, a C-terminal domain, a coiled-coil structure region, a proline-rich region, 
a spacer region, an insert, and a C-terminal tail; 

30 (e) is the complement ofthe nucleotide sequence of (d); 
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(Q encodes a domain of an amino acid sequence selected from the 
group set forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ED NO:124, SEQ ID NO:125, 
SEQ ID NO:126, SEQ E) NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, 
SEQ ID N0:131, SEQ ID NO:132, SEQ ID NO:133, SEQ IDNO:134, SEQ ID NO:135, 
5 SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO: 140, 
SEQ ID N0:141, SEQID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, 
SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150. 
SEQ ID N0:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, 
SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, 

10 SEQIDNO:161,SEQIDNO:162,SEQIDNO:163,SEQIDNO:164,SEQIDNO:165. 
SEQ ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170. 
SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, 
SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179. SEQ ID NO:180, 
SEQ ID N0:181, SEQ ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, 

15 SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, 
SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, 
SEQ ID NO: 196, SEQ ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, 
SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, 
SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, 

20 SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, 
SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, 
SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, 
SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, 
SEQ ID NO:23 1 , SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, 

25 SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, 
SEQ ID NO:241, and SEQ ID NO:242, wherein said domain is selected from the group 
consisting of an N-terminal domain, a catalytic domain, a C-terminal domain, a coiled-coil 
structure region, a proline-rich region, a spacer region, an insert, and a C-terminal tail; 
(g) is the complement of the nucleotide sequence of (f); 

30 (h) encodes a polypeptide comprising an amino acid sequence selected 

from the group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID 
NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID 
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NO:130, SEQ BD N0:131, SEQ ID NO:132, SEQID NO:133, SEQ ID NO:134, SEQ ID 
NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID 
NO:140, SEQ ID NO:141, SEQ ID N0.142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID 
NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID 
5 NO:150, SEQ ID N0:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID 
NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID 
NO:160, SEQ ID NO:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID 
NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID 
NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173. SEQ ID NO:174, SEQ ID 

10 N0:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID 
NO: 1 80, SEQ ID NO: 1 8 1 , SEQ ID NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID 
NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO: 189, SEQ ID 
NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194. SEQ ID 
NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:I99, SEQ ID 

1 5 NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID 
NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID 
NO:210, SEQ ID N0:211, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID 
NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID 
NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID 

20 NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID 
NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID 
NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID 
NO:240, SEQ ID NO:241, and SEQ ID NO:242, except that it lacks one or more, but not 
all, of the domains selected Scorn the group consisting of an N-terminal domain, a catalytic 

25 domain, a C-terminal domain, a spacer region, a proline-rich region, a coiled-coil structure 
region, and a C-terminal tail; or 

(i) is the complement of the nucleotide sequence ofQi). 
3. The nucleic.acid molecule of claim 1 , further comprising a vector or . 
promoter efTective to initiate transcription in a host ceU. 

30 
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4. The nucleic acid molecule of claim 1, wherein said nucleic acid molecule is 
isolated, enriched, or purified Smm a mammal. 

5. The nucleic acid molecule of claim 4, wherein said mammal is a human. 

6. A nucleic acid probe for the detection of nucleic acid encoding a kinase 

5 polypeptide in a sample, wherein said polypeptide is selected from the group consisting of 
SEQ ID NO: 122, SEQ ID NO:123, SEQ ID NO: 124, SEQ ID NO:125, SEQ ID NO: 126, 
SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, 
SEQ ro NO:132, SEQ ID NO:133, SEQ ID NO:134. SEQ ID NO:135, SEQ ID NO:136, 
SEQ ID NO:137, SEQ ED NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, 

10 SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146,' 
SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID N0:151, 
SEQ ID NO:152, SEQ ID N0:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, 
SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161, 
SEQ ID NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, 

15 SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171. 
SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, 
SEQ ID NO: 177, SEQ ID NO: 178, SEQ ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, 
SEQ ID NO:182, SEQ IDNO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, 
SEQ ID NO:187, SEQ IDNO:188, SEQ IDNO:189, SEQ E) NO:190, SEQ ID N0:191, 

20 SEQ ID NO: 1 99, SEQ ID NO: 193, SEQ ID NO: 1 94, SEQ ID NO: 1 95, SEQ ID NO: 1 96, 
SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, 
SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, 
SEQ JD NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID N0:21 0, SEQ ID N0:21 1, 
SEQ ID NO:212, SEQ IDNO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, 

25 SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ED NO:221, 
SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, 
SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, 
SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, 
SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ED NO:241, 

30 and SEQ ID NO:242. 
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7. The probe of claim 6, wherein said polypeptide is a fragment of the protein 
encoded by an amino acid sequence selected from the group consisting of SEQ ID 
NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ED N0:131, SEQ ID 
5 NO:I32, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID 
NO:137, SEQ ED NO:138, SEQ ID NO:139, SEQ ID NO:140. SEQ ID N0:141. SEQ ID 
NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO: 147, SEQ ID NO: 148, SEQ ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID 
NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, SEQ ID NO: 1 56, SEQ E) 

10 NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ED N0:161, SEQ ID 
NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID 
NO:167, SEQ ED NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ED N0:171, SEQ ID 
NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID 
NO:177, SEQ ED NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID NO:181, SEQ ID 

15 NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO;185, SEQ ED NO:186, SEQ ID 
NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID N0:190, SEQ ID NO:191, SEQ ID 
NO:199, SEQIDNO:193, SEQ ID NO:194, SEQ ID NO:195. SEQ IDNO:196, SEQ ID 
NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 

20 NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID N0:210, SEQ ID NO:21 1 , SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214. SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 

25 NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 
NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
IDNO:242. 
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8. A recombinant cell comprising a nucleic acid molecule encoding a kinase 
polypeptide selected from the group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ 
ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ K) NO:128, SEQ 
ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ K) NO:132, SEQ ID NO:133, SEQ 
ID NO:134, SEQ ID NO:135, SEQ ID NO:I36, SEQ ID NO:137, SEQ ID NO:138, SEQ 
ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ 
ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ 
ID NO: 149, SEQ ID NO: 150, SEQ ID NO: 151, SEQ ID NO: 152, SEQ ID NO: 153, SEQ 
ID NO:154. SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ 
ID NO:159, SEQ IDNO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID NO:163, SEQ 
IDNO:164, SEQ IDNO:165. SEQ IDNO:166, SEQ IDNO:167, SEQ ID NO:168, SEQ 
ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ ID NO: 172, SEQ ID NO: 173, SEQ 
ID NO:174, SEQ E) NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ 
ID NO: 179, SEQ ID NO: 180, SEQ ID NO: 181, SEQ ID NO: 182, SEQ ED NO: 183, SEQ 
ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, SEQ 
ID NO:189, SEQ ID NO:190, SEQ ID N0:191, SEQ IDNO:199, SEQ ID NO:193, SEQ 
ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ 
ID NO: 199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ 
ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ 
ID NO:209, SEQ ID NO:210, SEQ ID N0:211, SEQ E) NO:212, SEQ ID NO:213, SEQ 
ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ 
ID NO;219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ 
ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ 
ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ 
ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ • 
ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. 
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9. The cell of claim 8, wherein said polypeptide is a fragment of a protein 
encoded by an amino acid sequence selected from the group consisting of SEQ ID 
NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID 
NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID 

5 NO: 132, SEQ ID NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID. 
NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID NO:141, SEQ ID. 
NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID 
NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:151, SEQ ID 
NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID 

10 NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160. SEQ ID N0:161, SEQ ID 
NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID 
NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:I70, SEQ ED NO:171, SEQ ID 
NO:172, SEQ ED NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID 
NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID NO:181, SEQ ID 

15 NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186. SEQ ID 
NO:187, SEQ ID NO:188. SEQ ID NO:189. SEQ ID NO:190, SEQ ID NO:191, SEQ ID 
NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID 
NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203. SEQ ID NO:204. SEQ ID NO:205, SEQ ID NO:206. SEQ ID 

20 NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1. SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ED NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ED NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ED 

25 NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ED 
NQ:237, SEQ ID NO:238. SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
ID NO:242. 
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10. An isolated, enriched, or purified kinase polypeptide selected firom the 
group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, 
SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, 
SEQ ID N0:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, 

5 SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, 
SEQ ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, 
SEQ ID NO:146, SEQ ID NO:147, SEQ ID N0:148, SEQ ID NO:149, SEQ ID NO:150, 
SEQ ID NO: 1 5 1 , SEQ ID NO: 1 52, SEQ ID NO: 1 53, SEQ ID NO: 1 54, SEQ ID NO: 1 55, 
SEQ ID NO:156, SEQ ID NO:157. SEQ ID N0:158, SEQ ID NO:159, SEQ ID NO:160. 

10 SEQ ID NO:161, SEQ ID NO:162, SEQ IDNO:163, SEQ ID N0:164, SEQ ID NO:165. 
SEQIDNO:166,SEQIDNO:167, SEQIDNO:168, SEQIDNO:169, SEQIDN0:17G, 
SEQ ID N0:17I, SEQ ID NO:172, SEQ IDN0:173, SEQ ID NO:174, SEQ ID NO:175, 
SEQIDNO:176,SEQIDNO:177,SEQIDNO:178, SEQIDNO:179, SEQIDNO:180, 
SEQ ID N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:I85, 

15 SEQ ID NO:186, SEQ ID NO:187. SEQ IDNO:188, SEQ ID NO:189, SEQ ID NO:190, 
SEQ ID N0:i91 . SEQ ID NO: 199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, 
SEQ ID NO:196, SEQ TD NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, 
SEQ ID NO:201 , SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, 
SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, 

20 SEQ ID N0:211, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, 
SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO.220, 
SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, 
SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229. SEQ ID NO:230, 
SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, 

25 SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, 
SEQ ID NO:241, and SEQ ID NO:242. 
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1 1 . The polypeptide of claim 10, wherein said polypeptide is a fragment of the 
protein encoded by an amino acid sequence selected fiom the group consisting of SEQ ID 
NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:I25, SEQ ID NO:126, SEQ ID 
NO:127. SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID Nb:13 1, SEQ ID 

5 NO:132. SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID 
NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ ID 
NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146. SEQ ID 
NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID N0:15l, SEQ ID 
NO: 152, SEQ ID NO: 153. SEQ ID NO: 154. SEQ ID NO: 155, SEQ ID NO: 156, SEQ ID 

10 NO: 157, SEQ ID NO: 158, SEQ K) NO: 159, SEQ ID NO: 160, SEQ ID N0:161, SEQ ID 
NO: 162, SEQ ID NO: 163, SEQ ID NO: 164, SEQ ID NO: 165. SEQ ID NO: 166, SEQ ID 
NO:167, SEQ IDNO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ED 
NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID 
NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 . SEQ ID 

15 NO: 1 82, SEQ ID NO: 1 83, SEQ ID NO: 1 84, SEQ ID NO: 1 85, SEQ ID NO: 1 86, SEQ ID 
NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID NO: 191, SEQ ID 
NO:199, SEQ IDNO:193, SEQ ID NO:194, SEQ ID NO;195, SEQ ID NO:196, SEQ ID 
NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID 
NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID 

20 NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1, SEQ ID 
NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ ID 
NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID 
NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID 
NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID 

25 NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID 

NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ 
IDNO:242. 

12. The polypeptide of claim 1 0, wherein said polypeptide comprises: 

(a) an amino acid sequence selected from the group consisting of SEQ 
30 ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID N0:125, SEQ ID NO:126, SEQ 
ID N0:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID NO:131, SEQ 
ID NO:132. SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ 
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ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID N0:140, SEQ ID NO:141, SEQ 
ED NO:142, SEQ ID NO:143, SEQ ED NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ 
ID NO:147, SEQ IDNO:148. SEQ ID NO:149. SEQ ID NO:150, SEQ ID NO:151, SEQ 
ID NO:152, SEQ IDNO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ 
5 ID NO:157, SEQ IDNO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ED NO:161, SEQ 
ID NO:162, SEQ IDNO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ 
ID NO:167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, SEQ 
ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ 
ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID NO:181, SEQ 

10 ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ 
ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ED N0:191, SEQ 
ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO: 196, SEQ 
ID NO:197, SEQ ID NO:198. SEQ ID NO:199, SEQ ID NO:200, SEQ ED NO:201. SEQ 
ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206. SEQ 

15 ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID NO:21 1, SEQ 
ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ 
ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ 
ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ED NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ 

20 ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ 
ID N02237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and 
SEQ ID NO:242; 

(b) an amino acid sequence selected from the group consisting of SEQ 
ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ED NO:125, SEQ ID NO:126, SEQ 

25 ID NO:127, SEQ IDNO:128, SEQ ID NO:129, SEQ IDNO:130, SEQID N0:131, SEQ 
ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ 
ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ 
ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ED NO: 146, SEQ 
ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID NO:151, SEQ 

30 ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ 
ID NO: 157, SEQ ID NO: 158, SEQ E) NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, SEQ 
ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ 
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ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID NO:171, SEQ 
ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ 
ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID N0:181. SEQ 
ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ 
5 ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID N0:191, SEQ 
ID NO:199, SEQ ID NO:193. SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ 
ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ED NO:200, SEQ ID NO:201, SEQ 
ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 
ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ED NO:210, SEQ ID N0:21 1, SEQ 

10 ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID N0:216, SEQ 
ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221 , SEQ 
ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:23 1 , SEQ 
ID NO:232, SEQ ID NO:233. SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ 

15 ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and 
SEQ ID NO:242, except that it lacks one or more, but not all of the domains selected fiom 
the group consisting of an N-temiinal domain, a catalytic domain, a C-terminal domain, a 
spacer region, a proline-rich region, a coiled-coil structure region, and a C-terminal tail 
(c) a domain of an amino acid sequence selected from the group set 

20 forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID 
NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID 
N0:131, SEQ ID NO:132, SEQ ID N0:133, SEQ ID NO: 134, SEQ ID NO:135, SEQ ID 
NO:136, SEQ ID N0:137, SEQ ID N0:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID 
N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144. SEQ ID NO:145, SEQ ID 

25 NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID 
N0:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID 
NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID 
N0:161, SEQ ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID 
hrO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID 

30 N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID 
NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID 
N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ED NO:184, SEQ ID NO:185. SEQ ID 
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NO: 186, SEQ ID NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ID NO: 190, SEQ ID 
N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID 
NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID 
NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID 
NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:2iO, SEQ ID 
N0:211, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID 
NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID 
NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID 
NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ JD NO:229, SEQ ID NO:230, SEQ ID 
NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ED NO:234, SEQ ID NO:235, SEQ ID 
NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ED NO:240, SEQ ED 
NO:241, and SEQ ID NO:242 wherein said domain is selected from the group consisting 
of a C-terminal domain, a catalytic domain, an N-terminal domain, a spacer region, a 
proline-rich region, a coiled-coil structure region, and a C-terminal tail. 

13. • The kinase polypeptide of claim 10, wherein said polypeptide is isolated, 
purified, or enriched from a mammal. 

14. The kinase polypeptide of claim 13, wherein said mammal is a human. 

1 5 . The kinase polypeptide of claim 1 0, wherein said polypeptide is a 
AA144574, AAl 16841, AA256100, AA305176, AA210825, AA316804, AA98G090, 
N42050, AA476563,.AA626690, AA960957, H19102, AA045601, AA107515, 
AAl 09508 or AA887783 polypeptide. 

16. The kinase polypeptide of claim 10, wherein said polypeptide is a H60215, 
AA197883, AA297313, W30246, AA172300, AA383293, AA542015, H01248, N23936, 
W44160, 2R22-5-1 i, 5R72-1 8-1, AA021445, AA207220, AA426580, AA544838, 
W90839, 5R79-54-1, AA839940, R19772 or 5R72-8-2 polypeptide. 

17. The kinase polypeptide of claim 10, wherein said polypeptide is a 
AA23445 1 polypeptide. 

18. The kinase polypeptide of claim 10, wherein said polypeptide is a 5R65-16- 
1, AA061797. AA065538, AA124976, AA397553, AA435956, AA575635, AA626859, 
AA789239, AI086865, HI 7727, H29974, AA557536 or N28606 polypeptide. 

19. The kinase polypeptide of claim 10, wherein said polypeptide is a 
AA631990 or W08549 polypeptide. 
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20. The kinase polypeptide of claim 10, wherein said polypeptide is a 5R72-1 6- 
2, R19927 or R43524 polypeptide. 

2 1 . The kinase polypq)tide of claim 1 0, wherein said polypeptide is a 5R5 7- 1 0- 
2 polypeptide. 

5 22. The kinase polypeptide of claim 10, wherein said polypeptide is a 

AA232253 polypeptide. 

23. The kinase polypq^tide of claim 10, wherein said polypeptide is a 
AA430250, AA836348, R86668 orN34132 polypeptide. 

24. The kinase polypeptide of claim 1 0, wherein said polypeptide is a 
10 AA098024or SuRTKl 06 polypeptide. 

25. The kinase polypeptide of claim 10, wherein said polypeptide is a R47805, 
AA099102, AA589241, H85811, AA013524, AA452647, AA840598, AA088547, 
AA139478, AA826850, R87679, W65887, H97685, W2G810, AA599286, AA425725, 
AA103218, AA711829, AA060026, AA399669, AA758539, AA883975, AA948538, 

15 AA018361, AA21531 1, AA311714, AA498104, 5R69-17-2, 5R69-23-3, 5R69-26-2, 
AAl 18352, AA396601, AA671275, AA278842, AA460132 or H05721 polypeptide. 



WEST 



wo 00/73469 PCTAJSOO/14842 

148 

26. An antibody or antibody fragment having specific binding affinity to a 
kinase polypeptide selected from the group consisting of SEQ ID NO: 122, SEQ ID 
NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID 
NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID NO:132, SEQ ID 

5 NO: 133, SEQ ID NO: 134, SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID 
NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ ED NO:142, SEQ ID 
NO:143, SEQ ID N0:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO:148, SEQ ID NO:149. SEQ ID NO:150, SEQ ID N0:151, SEQ ID NO:152, SEQ ID 
NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID NO:156, SEQ ID NO:157, SEQ ID 

10 NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162. SEQ ID 
NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID 
NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID 
NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID 
NO:178, SEQ IDNO:179, SEQ ID N0:180, SEQ ID NO:181, SEQ ID NO:182, SEQ ID 

15 NO:183, SEQ IDNO:184, SEQ 1DN0:185, SEQIDNO:186, SEQ IDNO:187, SEQ ID 
NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID NO:191, SEQ ID NO:199, SEQ ID 
NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID 
NO: 1 98, SEQ ID NO: 1 99, SEQ ID NO:200, SEQ ID NO:201 , SEQ ID NO:202, SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ED NO:206, SEQ ID NO:207, SEQ ID 

20 NO:208, SEQ ID NO:209, SEQ ID NO:210. SEQ ID N0:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID N0:214, SEQ ID N0:215. SEQ ID NO:216, SEQ ID N0:217, SEQ ID 
NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ CD 
NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ED NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ E) NO:232, SEQ ED 

25 NO:233, SEQ ED NO:234. SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. 

27. The antibody or antibody fragment of claim 26, wherein said polypeptide 
comprises: 

(a) an amino acid sequence selected from the group consisting of SEQ 
30 ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ 
ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID NO: 131, SEQ 
ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ 
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ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ 
ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO: 146, SEQ 
ID NO:147, SEQ ID NO:148, SEQ ID NO: 149, SEQ ID NO:150, SEQ ID N0:151, SEQ 
ID NO: 152, SEQ ID NO: 153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ 
5 ID NO:157, SEQ p NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161, SEQ 
ID NO:162, SEQ ID NO:163, SEQ ID NO:164, SEQ ID N0:165. SEQ ID NO:166, SEQ 
ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ED NO:170, SEQ ID N0:171, SEQ 
ID NO:172, SEQ IDNO:173, SEQ IDNO:174, SEQ ED NO:175, SEQ ID NO:176, SEQ 
ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ED NO:180, SEQ ID N0:181, SEQ 

10 ID N0:182, SEQ IDN0:183, SEQ IDNO:184, SEQ ID N0:185, SEQ ID NO:186, SEQ 
ID NO: 187, SEQ ID NO: 188, SEQ ID NO: 189, SEQ ED NO: 190, SEQ ID N0:191, SEQ 
ID NO: 1 99, SEQ ID NO: 1 93, SEQ ID NO: 1 94, SEQ ID NO: 1 95. SEQ ID NO: 1 96, SEQ 
ID NO: 197, SEQ ID NO: 198, SEQ ID NO: 199, SEQ ID NO:200, SEQ ED NO:201. SEQ 
ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ 

15 ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1, SEQ 
ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID NO:216, SEQ 
ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ 
ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ IDNO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ 

20 ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ 
ID NO:237, SEQ ID NO:238, SEQ ED NO:239, SEQ ID NO:240, SEQ ID NO:241, and 
SEQIDNO:242; 

(b) an amino acid sequence selected from the group consisting of SEQ 
ID NO:122, SEQ ID NO:123, SEQ IDNO:124, SEQ ID NO:125, SEQ ID NO:126. SEQ 

25 ID NO:127, SEQ ID NO:128, SEQ IDNO:129, SEQ ED NO:130, SEQ ID N0:131, SEQ 
ID NO:132, SEQ ID NO:133, SEQ ID NO: 134, SEQ ED NO: 135, SEQ ID NO:136. SEQ 
ID NO:137, SEQ ID NO:138, SEQ IDNO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ 
ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ 
ID NO:147, SEQ ID NO:148, SEQ ED NO:149i SEQ ID NO:150, SEQ ID N0:15U SEQ 

30 ID NO: 152, SEQ IDNO:153, SEQ ID NO: 154, SEQ ID NO: 155, SEQ ID NO: 156, SEQ 
ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID N0:161, SEQ 
ID NO:162, SEQ EDNO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ED N0:166, SEQ 
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ID NO:167, SEQ roNO:168. SEQ IDNO:169, SEQ IDNO:170, SEQ ID N0:171, SEQ 
ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID NO:176, SEQ 
ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ IDNO:180, SEQ ID N0:181, SEQ 
ID NO: 182, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 186, SEQ 

5 ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ IDNO:190, SEQ ID N0:191, SEQ 
ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ 
ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ 
ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205. SEQ ID NO:206, SEQ 
ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ID N0:21 1, SEQ 

10 ID NO:212, SEQ ID N0:213, SEQ ID N0:214, SEQ ID NO:215, SEQ ID NO:216, SEQ 
ID NO:217. SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220. SEQ ID NO:221, SEQ 
ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ 
ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230. SEQ ID NO:23 1, SEQ 
ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ 

15 ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and 

SEQ ID NO:242, except that it lacks one or more, but not all, of the domains selected from 
the group consisting of a C-terminal domain, a catalytic domain, an N-tenninal domain, a 
spacer region, a proline-rich region, a coiled-coil structure region, and a C-terminal tail. 

(c) a domain of an amino acid sequence selected from the group set 

20 forth in SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID 
NO:126, SEQ ID N0:127. SEQ ID NO:128, SEQ IDNO:l29, SEQ ID NO:130. SEQ ID 
N0:131, SEQ IDNO:132. SEQ IDNO:133, SEQ IDNO:134, SEQ ID NO:135. SEQ ID 
NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID 
NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID 

25 NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ ID 
N0:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID NO:154, SEQ ID NO:155, SEQ ID 
NO:156, SEQ ID NO:157. SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ ID 
NO:161, SEQ ID NO:I62, SEQ ID NO:163, SEQ ID NO:164, SEQ ID NO:165. SEQ ID 
NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ ID 

30 N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ ID 
NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID 
N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185. SEQ ID 
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NO:186, SEQ ID NO:187, SEQ ED NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID 
N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ED NO:194, SEQ ID NO:195, SEQ ID 
NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID 
NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID 

5 NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID N0:21 0. SEQ ID 
N0:211, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ ID 
NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID 
NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ID 
NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID 

10 NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID 
NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID 
NO:241, and SEQ ID NO:242 wherein said domain is selected fix)m the group consisting 
of a C-tenninal domain, a catalytic domain, an N-teraiinal domain, a spaca- region, a 
proUne-rich region, a coiled-coil structure region, and a C-tenninal tail. 
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28. A hybridoma which produces an antibody having specific binding affinity 
to a kinase polypeptide selected from the group consisting of SEQ ID NO: 1 22, SEQ ID 
NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID 
NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ID NO:132, SEQ ID 

5 NO:133, SEQ ID NO:134, SEQ ID NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID 
NO:138, SEQ ID NO:139, SEQ ID NO:I40, SEQ ID N0:141, SEQ ID NO:142, SEQ ID 
NO:143, SEQ ID NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID 
NO:148, SEQ ID NO:149, SEQ ID NO:150. SEQ ID N0:151, SEQ ID NO:152, SEQ ID 
NO: 153, SEQ ED NO: 154, SEQ ID NO: 155* SEQ ID NO: 156, SEQ ID NO: 157, SEQ ID 

10 NO:158,SEQIDNO:159.SEQIDNO:160,SEQ1DNO:161,SEQIDNO:162,SEQID 
NO:163, SEQ ID NO:164. SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID 
NO:168, SEQ ID NO:169, SEQ ID NO: 170, SEQ IDN0:171, SEQ ID NO:172, SEQ ID 
NO:173, SEQ ID MO: 174, SEQ ID NO: 175, SEQ IDNO:176, SEQ ID NO:177, SEQ ID 
NO:178, SEQ ID NO:179, SEQ ID NO:180, SEQ ID N0:181, SEQ ID NO:182. SEQ ID 

15 NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID 
NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ K) NO:191, SEQ ED NO:199, SEQ ID 
NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ DD NO:197, SEQ ID 
NO:198, SEQ ID NO: 199, SEQ ID NO:200. SEQ ID NO:201, SEQ ID NO:202. SEQ ID 
NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID 

20 NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ ED N0:21 1, SEQ ID NO:212, SEQ ID 
NO:213, SEQ ID NO:214, SEQ ID NO:215. SEQ ID NO:216, SEQ ED NO:217, SEQ ID 
NO:218, SEQP NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID 
NO:223, SEQ ID N6:224, SEQ ED NO:225, SEQ ED NO:226, SEQ ID NO:227, SEQ ID 
NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ED 

25 NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID 
NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242. 

29. A method for identifying a substance that modulates kinase activity 
comprising: 

(a) contacting a kinase polypeptide selected from the group consisting 
30 SEQ ID NO:122, SEQ ID NO: 123, SEQ ID NO:124, SEQ ID NO:125, SEQ ID NO:126, 
SEQ ID NO: 127, SEQ ID NO: 128, SEQ ID NO: 129, SEQ ID NO: 130, SEQ ID N0:13 1, 
SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID NO: 135, SEQ ID NO:136. 
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SEQ ID NO:137, SEQ IDNO:138, SEQ ID NO:139, SEQ ID NO:140, SEQ ID N0:141, 
SEQ ID NO:142, SEQ IDNO:143, SEQ ID NO:144, SEQ ID NO:l45, SEQ ED NO:146, 
SEQ ID NO:147, SEQ IDNO:148, SEQ IDNO:149, SEQ IDNO:150, SEQ ID N0:151, 
SEQ ID NO:152, SEQ IDNO:153, SEQ IDNO:154, SEQ ID NO:155, SEQ ID NO:156. 
5 SEQ ID NO: 157, SEQ ID NO: 158, SEQ ID NO: 159, SEQ ID NO: 160, SEQ ID NO: 161, 
SEQ ID NO:162, SEQ IDNO:163, SEQ ID NO:164. SEQ ID NO:165. SEQ ID NO:166, 
SEQ ID NO: 167, SEQ ID NO: 168, SEQ ID NO: 169, SEQ ID NO: 170, SEQ ID NO: 171, 
SEQ ID NO: 1 72, SEQ ID NO: 1 73, SEQ ID NO: 1 74, SEQ ID NO: 1 75, SEQ ID NO: 1 76, 
SEQ ID NO: 1 77, SEQ ID NO: 1 78, SEQ ID NO: 1 79, SEQ ID NO: 1 80, SEQ ID NO: 1 8 1 , 

10 SEQ ID NO: 1 82, SEQ ID NO: 183, SEQ ID NO: 184, SEQ ID NO: 185, SEQ ID NO: 1 86, 
SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ ID N0:191, 
SEQ ID NO:199, SEQ IDNO:193, SEQ ID NO:194. SEQ ID NO:195, SEQ ID NO:196, 
SEQ ID NO:197, SEQ IDNO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ ID NO:201, 
SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ ID NO:206, 

15 SEQIDNO:207,SEQIDNO:208,SEQIDNO:209,SEQIDNO:210,SEQIDNO:211, 
SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214. SEQ ID NO:215, SEQ ID NO:216, 
SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ ID NO:221, 
SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ ED NO:226, 
SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ ID NO:231, 

20 SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ ID NO:236, 
SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ ID NO:241 , 
and SEQ ED NO:242 with a test substance; 

(b) measuring the activity of said polypeptide; and 

(c) determining whether said substance modulates the activity of said 

25 polypeptide. 

30. A method for identifying a substance that modulates kinase activity in a 
cell comprising: 

(a) expressing a kinase polypeptide in a cell, wherein said polypeptide 
is selected from the group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ ID 
30 NO: 124, SEQ ID NO: 125, SEQ ID NO: 126, SEQ DD NO: 127, SEQ ID NO: 128, SEQ ID 
NO:129, SEQ ID NO:130, SEQ ID N0:131, SEQ ED NO:132, SEQ ID NO:133, SEQ ID 
NO:134, SEQ ID NO:135, SEQ ID NO:136. SEQ ED NO:137, SEQ ID Np:138, SEQ ID 
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NO:139, SEQ ID NO:140, SEQ ID N0:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ID 
NO:144, SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID 
NO:149, SEQ ID NO:150, SEQ ID N0:151, SEQ ID NO:152, SEQ ID NO:153, SEQ ID 
NO:154, SEQ ID NO:155, SEQ IDNO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID 

5 NO:159, SEQID NO:160, SEQ IDN0:161, SEQ IDNO:162, SEQ ID NO:163, SEQ ID 
NO:164, SEQ ID NO:165. SEQ ID NO:166, SEQ IDNO:167, SEQ ID NO:168, SEQ ED 
NO:169, SEQ ID NO:170, SEQ ID N0:171, SEQ ID NO:172, SEQ ID N0:1 73, SEQ ID 
NO:174, SEQ ID NO:175, SEQ ID NO:176. SEQ ID NO:177, SEQ ID NO:178, SEQ ID 
NO: 179. SEQ ID NO: 180, SEQ ID NO: 1 81, SEQ ID NO:182, SEQ ID NO: 1 83, SEQ ID 

10 NO:184. SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID 
NO:189, SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199. SEQ ID NO:193, SEQ ID 
NO:194, SEQ ID NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID 
NO:199, SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203. SEQ ID 
NO:204, SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID 

15 NO:209,SEQIDNO:210,SEQIDNO:211,SEQIDNO:212,SEQIDNO:213, SEQID 
NO:214. SEQ ID NO:215, SEQ ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID 
NO:219, SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID 
NO:224, SEQ ID NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID 
NO:229, SEQ ID NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID 

20 NO:234, SEQ ID NO:235, SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID 
NO:239, SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242; 

(b) adding a test substance to said cell; and 

(c) monitoring a change in cell phenotype or the interaction between 
said polypeptide and a natural binding partner. 
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31. A method for treating a disease or disorder by administering to a patient in 
need of such treatment a substance that modulates the activity of a kinase selected from 
the group consisting of SEQ ID NO:122. SEQ ID N0:123, SEQ ID NO:124, SEQ ID 
NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129. SEQ ID 

5 NO:130, SEQ ID NO:131, SEQ ID NO:132, SEQ ID NO:133, SEQ ID NO:134, SEQ ID 
NO:135, SEQ ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139. SEQ ID 
NO:140, SEQ ID NO:141, SEQ IDNO:142, SEQ ID NO: 143, SEQ ID NO:144. SEQ ID 
NO:145, SEQ ID NO:146. SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149. SEQ ID 
NO:150, SEQ ID NO:151, SEQ ID NO:I52. SEQ ID N0:153, SEQ ID NO:154, SEQ ID 

10 NO:155. SEQ ID NO:156, SEQ ID NO:157. SEQ ID NO:158, SEQ ID NO:159. SEQ ID 
NO:160, SEQ ID NO:161, SEQ IDNO:I62, SEQ ID NO:163, SEQ ID NO:164. SEQ ID 
NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID 
NO:170, SEQ ID NO:171, SEQ ID NO:172, SEQ ED NO:173, SEQ ID NO:174. SEQ ID 
NO:175, SEQ ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID 

15 NO:180, SEQ ID NO:181, SEQ IDNO:182. SEQ IDNO:183, SEQ ID NO:184, SEQ ID 
NO:185, SEQ ID NO:186, SEQ IDNO:187, SEQ IDNO:188, SEQ ID NO:189, SEQ ID 
NO:190, SEQ ED N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID 
NO:195, SEQ ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID 
NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID 

20 NO:205, SEQ ID NO:206, SEQ ED NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID 
NO:210, SEQ ID N0:21 1, SEQ ID NO:212. SEQ ID NO:213, SEQ ID NO:214, SEQ ID 
NO:215, SEQ ID NO:216, SEQ ID NO:217. SEQ ID NO:218, SEQ ID NO:219, SEQ ID 
NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID 
NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID 

25 NO:230, SEQ ID NO:23 1. SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234. SEQ ID 
NO:235, SEQ ID NO:236. SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239. SEQ ID 
NO:240, SEQ ID NO:241, and SEQ ID NO:242. 

32. The method of claim 3 1 , wherein said disease or disorder is selected from 
the group consisting of immune-related diseases and disorders, cardiovascular disease, 

30 neurodegenerative disorders, and cancer. 

33. The method of claim 31 , wherein said substance modulates kinase activity 

in vitro. 
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34. The method of claim 33, wherein said substance is a kinase inhibitor. 

35 . A method for detection of a kinase polypeptide in a sample as a diagnostic 
tool for a disease or disorder, wherein said method comprises: 

(a) contacting said sample with a nucleic acid probe which hybridizes 
under hybridization assay conditions to a nucleic acid target region of a kinase polypeptide 
selected from the group consisting of SEQ ID NO:122, SEQ ID NO:123, SEQ ID NO:124, 
SEQ ID NO:125, SEQ ID NO:126, SEQ ID NO:127, SEQ ID NO:128, SEQ ID NO:129, 
SEQ ID NO:130, SEQ ID N0:131, SEQ ID NO:132. SEQ ID NO:133, SEQ ID NO:134, 
SEQ ID NO: 135, SEQ ID NO: 136, SEQ ID NO: 137, SEQ ID NO:138, SEQ ID NO: 139. 
SEQ ID NO: 140, SEQ ID NO: 141, SEQ ID NO: 142, SEQ ID NO:143, SEQ ID NO: 144, 
SEQ ID NO:145, SEQ ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, 
SEQ ID NO:150, SEQ ID N0:151, SEQ ID NO:I52, SEQ ID NO:153, SEQ ID NO:154, 
SEQ ID N0:155, SEQ ID NO:156i SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, 
SEQ ID NO:160, SEQ ID N0:161, SEQ ID NO:162, SEQ ID NO:163. SEQ ID NO:164, 
SEQ ID NO:165. SEQ ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, 
SEQ ID NO:170. SEQ ID N0:171, SEQ ID NO:172, SEQ ID NO:173, SEQ ID NO:174, 
SEQ ID NO:175, SEQ ID NO:176. SEQ ID N0:177, SEQ ID NO:178, SEQ ID NO:179, 
SEQ ID NO:180. SEQ ID N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, 
SEQ ID NO:185, SEQ ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, 
SEQ ID NO:190, SEQ ID N0:191, SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, 
SEQ ID NO:195, SEQ ID NO:196, SEQ ID N0:197, SEQ ID NO:198, SEQ IDNO:199, 
SEQ ID NO:200, SEQ ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ED NO:204, 
SEQ ID NO:205, SEQ ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, 
SEQ ID NO:210. SEQ ID N0:211, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, 
SEQ ID NO:215. SEQ ID NO:216, SEQ ID NO:217, SEQ ED NO:218, SEQ ID NO:219. 
SEQ ID NO:220, SEQ ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224. 
SEQ ro NO:225, SEQ ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, 
SEQ m NO:230, SEQ ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, 
SEQ ID NO:235,.SEQ ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, 
SEQ ID NO:240, SEQ ID NO:241, and SEQ ID NO:242, said probe comprising the 
nucleic acid sequence encoding said polypeptide, fragments thereof, or the complements 
of said sequences and fragments; and 
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(b) detecting the presence or amount of the probe:target region hybrid 
as an indication of said disease. 

36. The method of claim 35, wherein said disease or disorder is selected from 
the group consisting of immune-related diseases and disorders, cardiovascular disease, 
neurodegenerative disorders, and cancer. 

37. A method for detection of a kinase polypeptide in a sample as a diagnostic 
tool for a disease or disorder, wherein said method comprises: 

(a) comparing a nucleic acid target region encoding said kinase 
polypeptide in a sample, wherein said kinase polypeptide is selected from the group 
consisting of SEQ ID NO:I22, SEQ ID NO:123, SEQ ID NO:124, SEQ ID NO:125, SEQ 
ID NO:126, SEQ ID NO:127. SEQ ID NO:128, SEQ ID NO:129, SEQ ID NO:130, SEQ 
IDNO:131, SEQ IDNO:132, SEQ ID NO:133, SEQ IDNO:134, SEQ ID NO:135, SEQ 
ID NO:136, SEQ ID NO:137, SEQ ID NO:138, SEQ ID NO:139. SEQ ID NO:140, SEQ 
ID NO:141, SEQ ID NO:142, SEQ ID NO:143, SEQ ED NO:144, SEQ ID NO:145, SEQ 
ID NO:146, SEQ ID NO:147, SEQ ID NO:148, SEQ ID NO:149, SEQ ID NO:150, SEQ 
ID NO:151, SEQ IDNO:152, SEQ ID NO:153, SEQ IDNO:154, SEQ ED NO:155, SEQ 
ID NO:156, SEQ ID NO:157, SEQ ID NO:158, SEQ ID NO:159, SEQ ID NO:160, SEQ 
ID NO:161, SEQ JD NO:162, SEQ ID NO:163. SEQ ID NO:164, SEQ ID NO:165. SEQ 
ID NO:166, SEQ ID NO:167, SEQ ID NO:168, SEQ ID NO:169, SEQ ID NO:170, SEQ 
ID N0:171, SEQ IDNO:172, SEQ ID NO:173, SEQ ID NO:174, SEQ ID NO:175, SEQ 
ID NO:176, SEQ ID NO:177, SEQ ID NO:178, SEQ ID NO:179, SEQ ID NO:180. SEQ 
ID N0:181, SEQ ID NO:182, SEQ ID NO:183, SEQ ID NO:184, SEQ ID NO:185, SEQ 
ID NO:186, SEQ ID NO:187, SEQ ID NO:188, SEQ ID NO:189, SEQ ID NO:190, SEQ 
ID N0:191. SEQ ID NO:199, SEQ ID NO:193, SEQ ID NO:194, SEQ ID NO:195, SEQ 
ID NO:196, SEQ ID NO:197, SEQ ID NO:198, SEQ ID NO:199, SEQ ID NO:200, SEQ 
ID NO:201, SEQ ID NO:202, SEQ ID NO:203, SEQ ID NO:204, SEQ ID NO:205, SEQ 
ID NO:206, SEQ ID NO:207, SEQ ID NO:208, SEQ ID NO:209, SEQ ID NO:210, SEQ 
ID N0:211, SEQ ID NO:212, SEQ ID NO:213, SEQ ID NO:214, SEQ ID NO:215, SEQ 
ID NO:216, SEQ ID NO:217, SEQ ID NO:218, SEQ ID NO:219, SEQ ID NO:220, SEQ 
ID NO:221, SEQ ID NO:222, SEQ ID NO:223, SEQ ID NO:224, SEQ ID NO:225, SEQ 
ID NO:226, SEQ ID NO:227, SEQ ID NO:228, SEQ ID NO:229, SEQ ID NO:230, SEQ 
ID NO:231, SEQ ID NO:232, SEQ ID NO:233, SEQ ID NO:234, SEQ ID NO:235, SEQ 
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ID NO:236, SEQ ID NO:237, SEQ ID NO:238, SEQ ID NO:239, SEQ ID NO:240, SEQ 
ID NO:241, and SEQ ID NO:242, or one or more fragments thereof; with a control nucleic 
acid target region encoding said kinase polypeptide, or one or more fragments thereof; and 

(b) detecting differences in sequence or amount between said target 
region and said control target region, as an indication of said disease or disorder. 

38. The method of claim 37, wherein said disease or disorder is selected from 
the group consisting of immune-related diseases and disorders, cardiovascular disease, 
neurodegenerative disorders, and cancer. 
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SEQ ID NO: 122_X69117_H BARK2_H 
MADLEAVLADVSYLMAKEKSKATPAAiyVSKRIVLPEPSIRSV^ 

QKIGFLLFKDFCLNEINEAVPQVKFYEEIKEYEKLDNEEDRLCRSRQIYDAYIMKELLSC 

SHPFSKQAVEHVQSHLSKKQVTSTLFQPYIEEICESLRGDIFQKFMESDKFTRFCQWKNV 

ELNIHLTMNEFSVHRIIGRGGFGEVYGCRKADTGKiynfAMKC^ 

IMLSLVSTGDCPFIVCMTYAFHTPDKLCFILDUVINGGDLHYHLSQHGVFS 

IILGLEHVHNRFVVYRDLKPANILLDEHGHARISDLGLACDFSKKKPHA^ 

VLQKGTAYDSSADWFSLGCmFKLLRGHSPFRQHKTKDKHEIDRMTLTVNVELPDTF 

LKSLLEGLLQRDVSKRLGCHGGGSQEVKEHSFFKGVDWQHVYLQKYPPPLIPPRGEW 

DAFDIGSFDEEDTKGIKLLDCDQELYKNFPLVISERWQQEVTETVYEAVNADTDKIEARK 

RAKNKQLGHEEDYALGKDCIMHGYMLKLGNPFLTQWQRRYFYLFPNRLEWRGEGESRQNL 

LTMEQILSVEETQIKDKKCILFRIKGGKQFVLQCESDPEFVQWKKELNETFKEAQRLLRR 

APKFLNKPRSGTVELPKPSLCHRNSNGL 

SEQ ID NO: 123_AA144574_M BARK2_M 

CFVVYRDLKPANILLDEYGHVRISDLGLACDFSKKKPHASVGTHGYMAPEVLQKGTCYDS 
SADWFSLGCI^FKLLRGHSPFRQHKTKDKHEIDRMTLTVNVQLPDAFSPELRSLLEGLLQ 
RDVSQRLGCGGGGARELKEHIFFKGIDWQHVYLRKYPPPLIPPRGEVNAADAFDIGSFDE 
EDTKGI KLLDCDQDLYKNFPLVI SERWQQEWETI YDAVNADTDKI EARKKAKNKQLGQE 
EDYAMGKDCIMHGYMLKLGNPFLTQWQRRYFYLFPNRLEWRGEGESRQSLLTMEQIMSVE 
ETQIKDRKCIIiliRIKGGKQFVLQCESDPEFAQWLKEIiTCTFNEAQRLLRRAPKFLNKPRA 
AILEFSKPPLCHRNSSGL 

SEQ ID NO: 124_AA826850__H 

MGS SMSAATARRPVFDDKEDVNFDHFQI LRAIGKGSFGKVCI VQiqiDTEKMYAMKy^ 

QCIERDEVRNVFRELEILQEIEH^^VNLWYSFQDEEDMFIWVDLLLGGDIiRYHL^ 

FSEDTVRLYI CEMALALDYLRGQHI IHRDVKPDNILLDERGHAHLTDFNI AT 

TALAGTKPYMAPEIFXSFWGGTGYSFEVDWWSVGVMAYELLRGWRPYDIHSSNAVESLV 

QLFSTVSVQYVPTWSKEMVALLRKI^TVNPEHRLSSLQDVQAAPALAGVLraH^ 

PGFVPNKGRLHCDPTFELEEMI LESRPLHKKKKRLAKNKSRDNSRDS SQSENDYLQDCLD 

AIQQDFVIFNREKLKRSQDLPREPLPAPESRDAAEPVEDEAERSALPMCGPICPSAGSG 

SEQ ID NO: 125_AA960957_H 

MGGNHSHKPPVFDENEEVNFDHFQILRAIGKGSFGKVCIVQKRDTKK^^AMKYMNKQKC 

ERDEVRNVFRELQIMQGLEHPFLVNLWYSFQDEEDMFMVVDLLLGGDLRYHLQQNVHFTE 

GTVKLYICELALALEYLQRYHIIHRDIKPDNILLDEHGHVHITDFNIATVVKGAER^ 

AGTKPYMAPEVFQVYMDRGPGYSYPVDWWSLGITAYELLRGWRPY^IHSWPIDEILNM 

KVERVHYSSTWCKGMVALLRKLLTKDPESRVSSLHDIQSVPYLADMNra 

VPNKGRLNCDPTFELEEMILESKPLHKKKKRLAKNRSRDGTKDSCPLNGHLQHCLEl^ 

EFIIFNREKLRRQQGQGSQLLDTDSRGGGQAQSKIiQDGCNimiLTHTCTRGCSS 

SEQ ID NO: 126_TBK1_H 

MQSTSNHLWLLSDILGQGATANVFRGRHKKTGDLFAIKVFNNISFLRPVDVQMRE 

KLNHKNIVKLFAIEEETTTRHKVLIMEFCPCGSLYTVIiEEPSNAYGLPESEFLI 

GGMNHLRENGIVHRDIKPGNIMRVIGEDGQSVYKLTDFGAARELEDDEQFVSLYG^ 

HPDI^ERAVLRKDHQKKYGATVDLWS IGVTFYHAATGSLPFRPFEGPRRNKEVMYKI I TG 

KPSGAISGVQKAENGPIDWSGDMPVSCSLSRGLQVLLTPVLANILEADQEKCWGFDQFFA 

ETSDILHRMVIHVFSLQQMTAHKIYIHSYNTATIFHELVYKQTKIISSNQELIYEGRRLV 

LEPGRI^QHFPKTTEENPIFWSREPLNTIGLIYEKISLPKVHPRYDLDGDASMAKAITG 

WCYACRIASTIiLLYQELMRKGIRWLIELIKDDYNETVHKKTEWITLDFCIRNIEKTVK 




wo 00/73469 



PCT/USOO/14842 



FIGURE IB 

VYEKIJ^KINLEAAELGEISDIHTKLLRLSSSQGTIETSLQDIDSRLSPGGSLADAWA^ 

GTHPKDRNVEKLQVLLNCiyrrEIYYQFKKDKAERRLAyNEEQIHKFDKQK^ 

FTDECVKKYEAFl^KSEEWIRKMLHLRKQLLSLTNQCFDIEEEVSKYQEY^^ 

lO^FTASSGIKHTMTPIYPSSNTLVEMTLXSMKKLKEEMEGWKELAENN^ 

GGLRNVDCL 

SEQ ID NO: 127_AA305176_H 

MDPTAGSKKEPGGGAATEEGVNRIAVPKPPSIEEFSIVKPISRGAFGKVYLGQKGGKLYA 
VKVVKKADMINKNMTHQVQAERDAIiALSKS PF I VHL YYSLQSANNVYLVME YL I GGDVKS 
LLHIYGYFDEEI^VKYISEVALALDYLHRHGIIHRDLKPDNML 
TlilJnUDINMMDILTTPSMAKPRQDYSRTPGQVLSLISSIXSFOT 

ETSQLSQGLVCPMSVDQKDTTPYSSKLLKSCLETVASNPGMPVKCLTSl^LQSRKRIiATS 
SASSQSHTFISSVESECHSSPKWEKDCQV 

SEQ ID NO: 128_AA116841_M 

TRPIPWPEGEEKLSDNAQSAIsroMLLTIDDSKRAGMRELKQHPLFSEVDWENIiQHQTMPFV 
PQPDDETDTSYFEARNNAQHLTVSGFSL 

SEQ ID NO: 129_AA256100_H 

MAMTAGTTTTFPMSNHTRERVWAKLTLENFYSNLILQHEERETRQK^ 
EEKKLRRSQHARKETEFLRLKRTRLGIiDDFESLKVIGRGAFGEVRLVQKKDTGHIYAMKI 
LRKSDMLEKEQVAHIRAERDILVEADGAWWKMFYSFQDKRNLYLIMEFLPGGDMMTLLM 
KKDTLTEEETQFYI SETVLAIDAIHQLGFIHRDI KPDNLLLDAKGHVKLSDFGLCTGLKK 
AHRTEFYRNLTHNPPSDFSFQNMNSKRKAETWKiCNRRQLAYSTVGTPDYIAPE^ 
NKLCDWWSLGVIMYEMLIGYPPFCSETPQETYRKVIWWKETLVFPPEVPISEKAKDLILR 
FCIDSENRIGNSGVEEI KGHPFFEGVDWEHI RERPAAI P lEI KS IDDTSNFDDFPESDIL 
QPVPNTTEPDYKSKDWVFLNYTYKRFEGLTQRGS I PTYMKAGKL 

SEQ ID NO: 13 0_AA2 10825_H 

DSLLPTPALGTPLPIPWPVGSLRTPLSLESTRSPTQRIiLPSTPKDPAILRSPPPARSFLG 

SPLSHHLLTRSRGSRTQGPPGPPGGSRVGSRRAVPGLPPWPPPPHYPAGLPGSPGPGSPP 

PPGGLELQSPPPLLPQI PAPGSGVSFHIQIGLTREFVLLPAASELAHVKQLACS I VDQKF 

PECGFYGLYDKILLFKHDPTSANLLQLVRSSGDIQEGDLVEWLSASATFEDFQIRPHAL 

TVHSYRAPAFCDHCGEMLFGLWQGLKCDGCGLNYHKRCAFSIPNNCSGARKRRLSSTSL 

ASGHSVRLGTSESLPCTAEELSRSTTELLPRRPPSSSSSSSASSYTGRPIELDKMLLSKV 

KVPHTFLIHSYTRPTVCQACKKLLKGLFRQGLQCKDCKFNCHKRCATRVPNDC^GEALIN 

GDVPMEEATDFSEADKSALMDESEDSGVIPGSHSENALHASEEEEGEGGKAQSSLGYIPL 

MRWQSVRHTTRKSSTTLREGWVVHYSNTOTLRKRHYWRLDCKCITLFQl^^ 

PLSEILTVESAQNFSLVPPGTNPHCFEIVTANATYFVGEMPGGTPGGPSGQGAEAARGIiX 

ETAIRQALMPVILQDAPSAPGHAPHRQASLSISVSNSQIQENVDIATVYQIFPDEVLGSG 

QFGVVYGGKHRKTGRDVAVKVIDKLRFPTKQESQLRNEVAILQSLRHPGIVNLECMFETP 

EKVFVVMEKLHGDMLEMILSSEKGRLPERLTKFLITQILVALRHLHFKNIVHCD^ 

LLASADPFPQVKLCDFGFARIIGEKSFRRSWGTPAYLAPEVLLNQGYNRSLDMWSVGVI 

14YVSLSGTFPFNEDEDINDQIQNAAFMYPASPWSHISAGAIDLINNLLQVTCMR 

SLSHPWLQEYQTWLDLRELEGKMGERYITHESDDARWEQFAAEHPLPGSGLPTDRDLGGA 

CPPQDHDMQGLAERI SVL 

SEQ ID NO: 13 1_AA127299_H 

IQFIIVGAKDLLAJmSNGLSDPYIKITNLSQKTKVIKKTLTPTi^^ 
ECWDHDTFSDDFIGKASISLAEIPALAEVDMWIDMKTKKGEFAGK 
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FIGURE IC 



SEQ ID NO: 132_AA316804_H 

MSANNSPPSAQKSVLPTAI PAVLPAAS PCS S PKTGLSARLSNGS FS APSLTNSRGS VHTV 

SFLLQIGLTRESVTIEAQELSLSAVKDLVCSIVYQKFPECGFFGMYDKILLFRHDMNSEN 

ILQLITSADEIHEGDLVEVVLSAIATVEDFQIRPHTLYVHSYKAPTFCDYCGEMLWGLVR 

QGLKCEGCGLNYHKRCAFKIPNNCSGVRKRRLSNVSLPGPGLSVPRPLQPEYVALPSEES 

HVHQEPSKRIPSWSGRPIWIffiKMVMCRVKVPHTFAVHSYTRPTICQYCKRLLKGLFRQGM 

QCKDCKFNCHKRCASKVPRDCLGEVTFNGEPSSLGTDTDIPMDIDNNDINSDSSRGLDDT 

EEPSPPEDKMFFLDPSDLDVERDEEAVKTISPSTSNNIPLMRWQSIKHTKRKSSTMVKE 

GWMVHYTSRDNIJIKRHYWRLDSKCLTLFQNESGSKYYKEIPLSEILRISSPRDFTNISQ^ 

SNPHCFEIITDT^WYFVGE1WGDSSHNPVI^TGVGLDVAQSWEKAIRQALMPVTPQAS 

CTSPGQGKDHKDLSTSISVSNCQIQENVDISTVYQIFADEVLGSGQFGIVYGGKHRKTGR 

DVAIKVIDKMRFPTKQESQLRNEVAILQNLHHPGIWLECMFETPERVFVVMEKLHGDML 

EMILSSEKSRLPERITKFMVTQILVALRISn^HFKNIVHCDLKPEmnjLAS^ 

FGFARI IGEKSFRRSWGTPAYLAPEVLRSKGYNRSLDMWSVGVI I YVSLSGTFPFNEDE 

DINDQIQNAAFMYPPNPWREISGEAIDLINNLLQVKMRKRYSVDKSLSHPWLQDYQTWLD 

LREFETRIGERYITHESDDARWEIHAYTHNLVYPKHFIMAPNPDDMEEDP 

SEQ ID NO: 133_PKNBETA_H 

MEEGAPRQPGPSQWPPEDEKEVIRRAIQKELKIKEGVENLRRVATDRRHLGHVQQLLRSS 
NRRLEQLHGELRELHARILLPGPGPGPAEPVASGPRPWAEQLRARHLEAIJIRQLHVELKV 
KQQAENl^miTCASGTPKERKLIiAAAQQMLRDSQLKVALLRMKISSLEASGSPEPGPELLA 
EELQHRLHVEAAVAEGAKNVVKLLSSRRTQDRKALAE^ 

EQLPPAHPLRSRVTRELRAAVPGYPQPSGTPVKPTALTGTLQVRLLGCEQLLTAVPGRSP 

AAAIASSPSEGWLRTKAKHQRGRGEIASEVIAVLKVDNRWGQTGWGQVAEQSWDQTFVI 

PLERARELEIGVHWRDWRQLCGVAFLRLEDFUDNACHQLSLSLVPQGLLFAQVTFCDPVI 

ERRPRLQRQERI FSKRRGQDFLRRSQMNLGMAAWGRLVMNLLPPCS SPSTI SPPKGCPRT 

PTTLREASDPATPSNFLPKKTPLGEEMTPPPKPPRLYLPQEPTSEETPRTKRPHMEPRTR 

RGPSPPASPTRKPPRLQDFRCLAVLGRGHFGKVLLVQFKGTGKYYAIKALKKQEVLSR^ 

lESLYCEKRILEAVGCTGHPFLLSLLVCFQTS&HARFVTEFVPGGDLMMQIHEDVFPEPQ 

ARFYVACVVLGLQFLHEKKIIYRDLKLDNLLLDAQGFLKIADFGLCKEGIGFGDRTSTFC 

GTPEFIAPEVLTQEAYTQAVDVWALGVLLYEMLVGECPFPGDTEEEVFDCIVNiy© 

FLSVQGLEFIQKLLQKCPEKRLGAGEQDAEEIKVQPFFRTTNWQALLARTIQPPFVPTLC 

GPADLRYFEGEFTGLPPALTPPAPHSLLTARQQAAFRDFDFVSERFLEP 

SEQ ID NO: 134_AI021023_M PKNBETA_M 

LKTONLLLDAQGFLKIADFGLCKEGIGFGDRTSTFCGTPEFLAPEVLTQEAYTRAVDWWG 
LGVLLYEMLVGECPFPGDTEEEVPDCIVNMDAPYPGFLSVQGLEFIQKLLQKCPEKRLGA 
GEQDAEEIKVQPFFRTTNWQALLARTIQPPFVPTLCGPADLRYFEGEFTGLPPALTPPAP 
HSLLTARQQAAFRDFDFVSERFLEP 

SEQ ID NO: 135_H19102_H 

GGNIRGPWARGWKSLWTGLGTIRSDLEELWELRGHHYLHQESLKPAPVLVEKPLPEWPVP 

QFINLFLPEFPIRPIRGQQQLKILGLVAKGSFGTVLK\n:iDCTQKAVFAVKWPKVKV^ 

DTVRQCKEEVS IQRQINHPFVHSLGDSWQGKRHLFIMCSYCSTDLYSLWSAVGCFPEAS I 

RLFAAELVLVLCYLHDLGIMHRDVKMENILLDERGHLKLTDFGLSRHVPQGAQAYTICGT 

LQYMAPEVLSGGPYNHAADVmSLGVLLFSLATGKFPVAAERDHVAMLASVTHSDSEIPAS 

LNQGLSLLLHEIiLCQNPLHRLRYLHHFQVHPFFRGVAFDPELLQKQPVNFVTETQATQPS 

SAETMPFDDFDCDLESFLLYPIPA 
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FIGURE ID 



SEQ ID NO: 136_AA476563_H 

MEFFRIDSKDSASELLGLDFGEKLYSLKSEPLKPFFTLPDGDSASRSFNTSESKVEFKAQ 

DTI SRGSDDSVPVI SFKDAAFDDVSGTDEGRPDLLVNLPGELESTREAAAMGPTKFTQTN 

IGIIENKLLEAPDVLCLRLSTEQCQAHEEKGIEELSDPSGPKSYSITEKHYAQEDPRMLF 

VAAVDHSSSGDMSLLPSSDPKFQGLGWESAVTANNTEESLFRICSPLSGANEYIASTDT 

LKTEEVLLFTOQTDDLAKEEPTSLFQRDSETKGESGLVLEGDKEIHQIFEDLDKK^^ 

RFYIPEGCIQRWAAEMWALDALHREGIVCRDLNPNNILLJTO 

CDSDAIERMYCAPEVGAITEETEACTWWSLGAVLFELLTGKTLVECHPAGINTHTTI^ 
ECVSEEARSLIQQLLQFNPLERLGAGVAGVEDIKSHPFFTPVDWAELMR 

SEQ ID NO: 13 7_AA62 66 90_H 

MLPFAPQDEPVTOREMEVFSGGGASSGEWGLKMVDEPMEEGEADSCHDEGVVKEIPITHH 

VKEGYEKADPAQFELLKVLGQGSFGK\^LVRKKTGPDAGQLYAMKVLKKASLKVRDRVRT 

KMERDILVEVNHPFIVKLHYAFQTEGKLYLILDFLRGGDVFTRLSKEVLFT^ 

ELALALDHLHQLGIVYRDLKPENILLDEIGHIKLTDFGLSKESVDQEKKAYSFCGT^ 

APEVVNRRGHSQSADWWSYGVIiMFEMLTGTLPFQGKDRNETMNMILK^^ 

QSLLRMLFKRNPANRLGSEGVEEIKRHLFFANIDWDKLYKREVQPPFKPASGKPDDTFCF 

DPEFTAKTPKDSPGLPASANAHQLFKGFSFVATSIAEEYKITPITSANVLPIVQINGNAA 

QFGEVYELKEDIGVGSYSVCKRCIHATTNMEFAVKIIDKSKRDPSEEIEIIJ4RYGQHPNI 

ITLKDVFDDGRYVYLVTDLMKGGELLDRILKQKCFSEREASDILYVISKTVDYLHCQGVV 

HRDLKPSNI LYMDE SASADS I RI CDFGFAKQLRGENGLLLTPCYTANFVAPEVLMQQGYD 

AACDIWSLGVLFYTMLAGYTPFANGPNDTPEEILLRIGNGKFSLSGGNWDNISDGAKDLL 

SHMLHmPHQRYTAEQILKHSWITHRDQLPNDQPKRJSroVSHVVKG^ 

PVLEPVAASSLAQRRSMKKRTSTGL 

SEQ ID NO: 138_AA215680_H 

MSLVACECLPSPGLEPEPCSRARSQAHVYLEQIRNRVALGVPDMTKRDYLVDAATQIRIiA 

LERDVSEDYEAAFNHYQNGVDVLLRGIHVDPNKERREAVKLKITKYLRRAEEIFNCHLQR 

PLSSGASPSAGFSSLRLRPIRTLSSAVEQLRGCRWGVIEKVQLVQDPATGGTFWKSLP 

RCHIWSRERLTI I PHGVPYMTKLIiRYFVSEDS I FLHLEHVQGGTLWSHLLSQAHSRHSGL 

SSGSTQERMKAQIJJPHLNLLTPARLPSGHAPGQDRIALEPPRTSPNLIJ^GEAPSra 

EAEGEPTARTSTSGSSDLPKAPGGHLHLQARRAGQNSDAGPPRGLTWVPEGAGPVLGGCG 

RGMDQSCLSADGAGRGCGRATWSVREEQVKQWAAEMLVALEALHEQGVLCRDLHPGNLLL 

DQAGHIRLTYFGQWSEVEPQCCGEAVDNLYSAPEVGGISELTEACDWWSFGSLLYELLTG 

MALSQSHPSGIQAHTQLQLPEWLSRPAASLLTELLQFEPTRRLGMGEGGVSKIiKSHPFFS 

TIQWSKLVG 

SEQ ID NO: 139_SGK__H 

MTVICTEAAKGTLTYSRMRGMVAILIAFMKQRRMGLNDFIQKIANNSY^^ 

SQPQEPELMNANPSPPPSPSQQINLGPSSNPHAKPSDFHFLKVIGKGSFGKVLLARHKAE 

EVFYAVKVLQKKAILKKKEEKHIMSERNVLLKNVKHPFLVGLHFSFQTADKLYFVI^ 

GGELFYHLQRERCFLEPRARFYAAEIASALGYLHSLNIVYRDLKPENILLDSQGHIVLTD 

FGLCKENIEHNSTTSTFCGTPEYLAPEVLHKQPYDRTVDWWCLGAVLYEMLYGLPPFYSR 

NTAE^ra3NILNKPLQLKPNITNSARHLLEGLLQKDRTKRLGAKDD^ 

DDLINKKITPPFNPNVSGPNELRHFDPEFTEEPVPNSIGKSPDSVLVTASVKEAAEAFLG 

FSYAPPTDSFL 

SEQ ID NO: 140_AA107515_M 

MTVKAEAARS TLTYSRMRGMVAI LI AFMKQRRMGLNDFI QKI ASNTYACKHAE VQS I LKM 
SHPQEPEIJ^ANPSPPPSPSQQINLGPSSNPHAKPSDFHFLKVIGKGSFGKVLIiARHKAE 
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FIGURE IE 



EVFYAVKVLQKKAILKKKEEKHIMSERNVLLKmrKHPFLVGLHFSFQTADKL^ 

GGELFYHLQRERCFLEPRARFYAAEIASALGYLHSLNIVYRDLKPENILLDSQGHIVLTD 

XFQLRRIEHNGTTSTFCGTPEyiAPEVLHKQPYDRTVDWWCLGAVLYEMLYGLPPFYSRN 

TAEMYDNILNKPLQLKPNITNSARHLLEGLLQKDRTKRLGAKDDFMEIKSHIFFSLINWD 

DLINKKITPPFNPNVSGPSDLRHFDPEFTEEPVPSSIGRSPDSILVTASVKEAAEAFLGF 

SYAPPVDSFL 

SEQ ID NO: 141_AA109508_M 

HLQRERRFLEPRARFYAAEVASAIGYLHSLNIIYRDLKPENILLDCQGHVVLTDFGLCKE 

GVEPEDTTSTFCGTPEYLAPEVLRKEPYDRAA/DWWCLGAVLYEMLHGLPPFYSQDVSQMY 

ENILHQPLQIPGGRTVAAC1)LLQSLLHKI)QRQRLGSKADFLEIKNHVFFSPINWD 

RLTPPFNPNVTGPADLKHFDPEFTQEAVSKSIGCTPDTVASSSGASSAFLGFSYAPEDDD 

ILDC 

SEQ ID NO: 142_AA887783_H 

MQRDHTMDYKESCPSVXIPSSDEHREKKKRFTVYKVIiVSVGRSEWFVFRRYAEFDK^ 
LKKQFPAXALKI PAKRIFGDNFDPDFI KQRRAGLNEFIQNLVRYPELYNHPDVRAFLQMD 
SPKHQSDPSEDEDERSSQKLHSTSQNINLGPSGNPHAKPTDFDFLKVIGKGSFGKVLLAK 
RKLDGKFYAVKVLQKKIVLNRKEQKHIMAERNVLLKNVKHP 

DFVNGGEGHVVLTDFGLCKEGIAISDTTTTFCGTPEYLAPEVIRKQPYDNTVD^ 
LYEMLYGLPPFYCRDVAEMYDNILHKPLSIiRPGVSLTAWSILEELLEKDRQNRLGAKEDF 
LEIQNHPFFESLSWADLVQKKIPPPFNPNVAGPDDIRNFDTAFTEETVPYSVCVSSDYSI 
VNASVLEADDAFVGFSYAPPSEDLFL 

SEQ ID NO: 143_R47805_H 

MAHQTGIHATEELKEFFAKARAGSVRLIKWIEDEQLVLGASQEPVGRWDQDYDRAVL 

LDAQQPCYLLYRLDSQNAQGFEWLFLAWSPDNSPWLKMLYAATRATVKKEFGGGHIKDE 

LFGTVKDDLSFAGYQKHLSSCAAPAPLTSAERELQQIRINEVKTEISVESKHQTLQGLAF 

PLQPEAQRALQQLKQKMVNYIQMKLDLERETIELVHTEPTDVAQLPSRVPRDAARYHF^^ 

YKHTHEGDPLESWFIYSMPGYKCSIKERMLYSSCKSRLIiDSVEQDFHLEIAKKIEIGDG 

AELTAEFLYDEVHPKQH/IFKQAFAKPKGPGGKRGHKRLIRGPGENGDDS 

SEQ ID NO: 144_H60215_H 

MSKLRMKRRASDRGAGETSARAKALGSGISGNNAKRAGPFILGPRLGNSPVPSIVQCIJ^ 
KDGTDDFYQLKILTLEERGDQGIESQEERQGKMLLHTEYSLLSLLHTQDGWHHHGLFQD 
RTCEIVEDTESSRMVKKMKKRICLVLDCLCAHDFSDKTADLINLQHYVIKEKRLS 
VIFYDVVRVVEALHQKlJIVHRDLKLGNMVIiNKRTHRITITO 

SPAYISPDVLSGRPYRGKPSDMWALGVVLFTMLYGQFPFYDSIPQELFRKIKAAEYTIPE 
DGRVSENTVCLIRKLLVLDPQQRLAAADVLEALSAIIASWQSLSSLSGPLQWPDIDDQM 
SNMDSSQEAIOrraECSQYEFENYMRQQLLIiAEEKSSIHDTRSWVPKRQF 
DAQPMTSLDTAILAQRYLRK 

SEQ ID NO: 145_SGK324_H 

MASTRSIELEHFEERDKRPRPGSRRGAPSSSGGSSSSGPKGNGLIPSPAHSAHCSFYRTR 

TLQALSSEKKAKKARFYRNGDRYFKGLVFAISSDRFRSFDALLIELTRSLSDNVNLPQGV 

RTIYTIDGSRKVTSLDELLEGESYVCASNEPFRKVDYTKNINPNWSVNIKGGTSRALAAA 

SSVKSEVKESKDFIKPKLVTVIRSGVKPRKAVRILLNKKTAHSFEQVLTDITEAIK^ 

VVKRLCTLDGKQVRVTCVHLPDFFGDDDVFIACGPEKFRYAQDDFVLDHSECRVLKSSYS 

RSSAVKYSGSKSPGPSRRSQISAHGRSSSNVNGGPELDRCISPEGVNGNRCSESSTLLEK 

YKIGKVIGDGNFAWKECIDRSTGKEFALKIIDKAKCCGKEHLIENEVSILRRVKHPNII 
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FIGURE IF 

MLVEEMETATELFLVMELVKGGDLFDAITSSTKYTERDGSAMVYNIANALRYLHGL 

RDIKPENLLVCEYPDGTKSLKLGDFGLATVVEGPLYWCGTPTYVAPXIIAETGYGLK^ 

IWAAGVITYILLCGFPPFRSENNLQEDLFDQILAGKLEFPAPYWDNITDSAKELISQMLQ 

VNVEARCTAGQILSHPWVSDDASQENNMQAEVTGKLKQHFNNALPKQNSTTTGVSVIMVS 

GRRQVWPDCGAGLEVFELGSRELPSHGSWCLP 

SEQ ID NO: 146_W30246_M SGK324_M 

TKSSSSSPTSPGSFRGLKISAQGRSSSNWGGPELDRCLSPEGVNCSSTRCSESFPLLEKYR 
IGKVIGDGNFAWKECVDRYTGKEFALKI IDKAKCCGKEHLIENEVSILRRVKHPNI IML 
VEEMETATDLFLVMELVKGGDLFDAITSSTKYTERDGSAlvrraJLANALR^ 
IKPENLLVCEYPDGTKSLKLGDFGLATWEGPLYTVCGTPTYVAPE 1 1 AETGYGLKVDVW 
AAGVITYILLCGFPPFRSENNLQEDLFDQILAGKLEFPAPYWDNITDSPCVCFRKCL 

SEQ ID NO: 147_AA3832 93_H 

PAAKRVWYRNGDPFFPGSQLWTQRRFPTMEAFLCEVTSAVQAPIiAVRALYTPCHGHPV 

TNLADLKNRGQYVAAGFERFHKLPPYQAFCLSVFRNGDLVS PPFSLKLSQAASQDWETVL 

KLLTEKyKLQSGAVRLCTLEGLPLSAGKELVTGHYYVAVGEDEFKDLPYPALSTRGLLAA 

GNEAHLRSGVGTVAGSPKPLGRKAKKETCLIVTLTLKYQQSETSRDGQSFPSGVIGVYGA 

PHRRKETAGALEVADDEDTQTEEPLDQRAAQIVEQVTCLQDFFGDDDVFIACGPEKFRYA 

QDDFVLDHSRRRLLREHQAGFEKLRRTRGEEKEAEKEKKPCMSGGRRMTLRDDQPAK^ 

EPKTRPEENKPERPSGRKPRPMGIIAANVEKHYETGRVIGDGNFAWKECRHRETRQAYA 

MKIIDKSRLKGKEDMVDSEILIIQSLSHPNIVKLHEVYETDMEIYLILEYVQGGDLFDAI 

lESVKFPEPDAALMIMDLCKALVHMHDKSIVHRDLKPENLLVQRNEDKS 

KHWRPIFTVCGTPTYVAPEILSEKGYGLEVDMWAAGVILYIIiLCGFPPFRSPXXGDQDE 

LFNI IQLGHFEFLPPYWDNI SDAAKDLVSRLLWDPKKRYTAHQVLQHPWIETAGKTNTV 

KRQKQVSPSSDGHFRSQHKRWEQVS 

SEQ ID NO: 14 8_AA197883_M 

MPTAPVLRPPPPPATPAPPAPSRPAPPIPGHRGPCDHSLKCLSSKISERKLPGPWLPAGR 

GPLEKPVI^PRGAVMPLFSPQSSLHSVRAEHSPLKPRVVTWKLGGQPLjRKATLLI^^ 

VQTFEQLLSDI SEALGFPRWKNDRVRKLFTLKGREVKS VSDFFREGDAFI AMGKE PLTLK 

SIQIJ^ELYPKNRALALAPHSRVPSPRLRSRLPSKIiLKGSHRCGEAGSYSAEM^ 

RHQGKTSTVIiAPEDKARAQKVmiGKQESEPGGPPSPGAATQEETHASGEKHLGA^ 

GEIVRCEKCKRERELQLGLQREPCPLGTSELDLGRAQKRDSEKLVRTKSCRRPSKAKFTD 

GEEGWKGDSHRGSPRDPPQEMRRPNSNSDKKEIRGSESQDSYPQGAPKAQKDFVEGPPAV 

EEGP I DMRREDRHTCRSKHAAWLRREQQAEPPQLPRTRGEEKQAEHEKKPGGLGERRAPE 

KES KRKLEEKRPERPSGRKPRPKGI I SADVEOYDI GGVI GDGNFATVKECRHRETKQAY 

AMKMIDKSQLKGKEDIVDSEILIIQSLSHPNIVKLHEVYETEAEIYLIMEYVQGGDLFDA 

IVENVKFPEPEAAVMITDLCKAFVHMHDKNIVHRDVTCPENLLVQRN^ 

AKYWRPIFTVCGTPTYVAPEILSEKGYGLEVDMWAAGVILYILLCGFPPFRSPERDQDE 

LFNIIQVGQFEFLSPYWDNISDAAKDLVRNLLEVDPKKRYTAEQVLQHPWIEMVGHTNTG 

NSQKEESPNSLGHFQSQHKKVAEQMP 

SEQ ID NO: 149_DRAK2_H 

MSRRRFDCRS I SGLLTTTPQI PI KMENFNNFYILTSKELGRGKFAVVRQCI SKSTGQEYA 
AKFLKKRRRGQDCRAEIIiHEIAVLELAKSCPRVINLHEVYENTSEIILILEYAAGGEIFS 
LCLPELAEMVSENDVIRLIKQILEGVYYLHQNNIVHLDIiKPQNILLSSIYPLGDIKIVDF 
GMSRKIGHACELREIMGTPEYIAPEILNYDPITTATDMWNIGIIAYMLLTHTSPFVGEDN 
QETYLNISQVNVDYSEETFSSVSQLATDFIQSLLVKNPEKRPTAEICLSHSWLQQWDFEN 



WEST 
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FIGURE IG 

LFHPEETSSSSQTQDHSVRSSEDKTSKSSCNGTCGDREDKENIPEDSSMVSKRFRFDDSL 
PNPHELVSDLLC 

SEQ ID NO: 150_W44160_M DRAK2_M 

MSRRRFDCRSVSGLLTTTPQTPIKTENFmrFYTLTPKELGRGKFAVVRQCISKSTGQEYA 

AKSLKKRRRGQDCRAEI LHE I AVLELARSCPHVINLHEVYENATE 1 1 LVLE YAAGGE I FN 

LCLPELAEMVSENDVI RLI KQI LEGVHYLHQNNI VHLDLKPQNI LLS S I YPLGDI KI VDF 

GMSRKIGNASELREIMGTPEYLAPEILNYDPITTATDMWNIGIIAYMLLTHTSPFVGEDN 

QETYI2JISQVNVDYSEEMFSSVSQLATDFIQSIJ^VKNPEKRPTAESCLSHSW1X2QWD 

LFHPEETSGSSQIQDLTLRSSEEKTSKSSCNGSCGAREDKENIPEDGSLVSKRFRFDDSL 

PSPHELVPDLFC 

SEQ ID NO: 151_H01248_H, DRAK1_H 

MIPLEKPGSGGSSPGATSGSGRAGRGLSGPCRPPPPPQARGLLTEIRAWRTEPFQDGYS 
LCPGRELGRGKFAVVRKCIKKDSGKEFAAKFMRKRRKGQDCRMEIIHEIAVLEIAQDNPW 
VINIJIEVYETASEMILVLEYAAGGEIFDQCVADREEAFKEKDVQRLMRQILEGVHFLHTR 
DWHLDLKPQNILLTSESPLGDIKIVDFGLSRILKNSEELREIMGTPEYVAPEILSYDPI 
SMATDMWS I GVLTYVMLTGI S PFLGNDKQETFLNI SQMNLSYSEEEFDVLSES AVDF I RT 
LLVKKPEDRATAEECLKHPWLTQSSIQEPSFRMEKALEEANALQEGHSVPEINSDTDKSE 
TEESIVTEELIWTSYTLGQCRQSEKEKMEQKAISKRFKFEEPLLQEIPGEFIY 

SEQ ID NO: 152_AA021445_H 

MPARIGYYEIDRTIGKGNFAVVKRATHLVTKAKVAIKI IDKTQLDEENLKKI FREVQIMK 

MLCHPHIIRLYQVMETERMIYLVTEYASGGEIFDHLVAHGRMAEKEARRKFKQIV^ 

CHCRNIVHRDLKAENLLLDANLNIKIADFGFSNLFTPGQLLKTWCGSPPYAAPELFEGKE 

YDGPKVDIWSLGVVLYVLVCGALPFDGSTLQNLRARVLSGKFRIPFFMSTECEHLIRHM 

VLDPNKRLSMEQICKHKWMKLGDADPNFDRLIAECQQLKEERQVDPLNE^ 

DKEQTLQSLRSDAYDHYSAIYSLLCDRHKRHKin^GALPSMPRAIiAFQAPVN 

TAMNISVPQVQLINPENQIVEPDGTTiNLDSDEGEEPSPEALWYLSMRRHTVGVADPRTE 

VMEDLQKLLPGFPGVNPQAPFLQVAPNVNFMHNLIiPMQNLQPTGQLEYKEQSLLQPPTLQ 

LmGMGPLGRRASDGGANIQLHAQQIiLKRPRGPSPLVTiyiTPAVPAVTPV^ 

EAVQRYLANRSKIlHTLAMTNPTAEIPPDLQRQIiGQQPFRSR^ 

TLHLPTERF S PVRRFSDGAAS I QAFKAHLEKMGNNS S I KQLQQECEQLQKMYGGQ IDERT 
LEKTQQQHMLYQQEQHHQILQQQIQDSICPPQPSPPLQAACENQPALLTHQLQRLRIQPS 
SPPPNHPNNHLFRQPSNSPPPMSSAMIQPHGAASSSQFQGLPSRSAIFQQQPENCSSPPN 
VALTCLGMQQPAQSQQVTIQVQEPVDMLSNMPGTAAGSSGRGISISPSAGQMQMQHRTNL 
MATLSYGHRPLSKQLSADSAEAHSLNVNRFSPANYDQAHIiHPHIiFSDQSRGSPSSYSPST 
GVGFSPTQALKVPPLDQFPTFPPSAHQQPPHYTTSALQQALLSPTPPDYTRHQQVPHILQ 
GLLSPRHSLTGHSDIRLPPTEFAQLIKRQQQQRQQQQQQQQQQEYQELFRHMNQGDAGSL 
APSLGGQSMTERQALSYQNADSYHHHTSPQHLLQIRAQECVSQASSPTPPHGYAHQPALM 
HSESMEEDCSCEGAKDGFQDSKSSSTLTKGCHDSPLLLSTGGPGDPESLLGTVSHAQELG 
IHPYGHQPTAAFSKNKVPSREPVIGNCMDRSSPGQAVELPDHNGLGYPARPSVHEHHRPR 
ALQRHHTIQNSDDAYVQLDNLPGMSLVAGKALSSARMSDAVLSQSSLMGSQQFQDGENEE 
CGASLGGHEHPDLSDGSQHLNSSCYPSTCITDILLSYKHPEVSFSMEQAGV 

SEQ ID NO: 153_2R22 -5-ll_H 

MTAVYMNGGGLVNPHYARWDRRDSVESGCQTESSKEGEEGQPRQLTPFEKLTQDMSQDEK 
VVREITLGKRIGFYRIRGEIGSGNFSQVKLGIHSLTKEKVAIKIII)KTKIjDQKTQRLLSR 
EISSMEKLHHPNIIRLYEVVETLSKLHLVMEYAGGGELFGKISTEGKLSEPESKLIFSQI 
VSAVKHl^ENQIIHRDLKAENVFYTSNTCVKVGDFGFSTVSKKGEiyQa^ 
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FIGURE IH 



LFRDEHYI GI YVD I WJUjGVLLYFMVTGTMPFRAETVAKLKKS I LEGTYS VP PH^ 
LIRGVLQQIPTERYGIDCIMNDEWMQGVPYPTPLEPFQLDPKHLSETSTLKEEENEVKST 
LEHLGITEEHIRNNQGRDARSSITGVYRIILHRVQRKKALESVPVMMLPDPKERDL^ 
RVYRGI RHTSKFCS I L 

SEQ ID NO: 154_R31237_1_H, AAC33487 

MSTRTPLPTVNERDTENHTSHGDGRQEVTSRTSRSGARCRNSIASCADEQPHIGNYRLLK 

■ TIGKGNFAKVKIJU^ILTGREVAIKI IDKTQIiNPTSLQKLFREWIMKIL^ 

VIETEKTLYLIMEYASGGEVFDYLVAHGRMKEKEARSKFRQIVSAVQYCHQKRIVHRDLK 

AENLLLDADMNIKIADFGFSNEFTVGGKLDTFCGSPPYAAPELFQGKKYDGPEVDVWSLG 

VI LYTLVSGSLPFDGQNLKELRERVLRGKYRI PFYMSTDCENLLKRFLVLNP I KRGTLEQ 

IMKDRWINAGHEEDELKPFVEPELDISDQKRIDIMVGMGYSQEEIQESLSKMKYDEITAT 

YLLLGRKSSELDASDSSSSSNLSLAKVRPSSDLNNSTGQSPHHKVQRSVSSSQKQRRYSD 

HAGPAIPSWAYPKRSQTSTADGDLKEDGISSRKSSGSAVGGKGIAPASPMLGNASNPNK 

ADI PERKKSSTVPSSNTASGGMTRRNTYVCSERTTADRHSVIQNGKENSTI PDQRTPVAS 

THSISSAATPDRIRFPRGTASRSTFHGQPRERRTATYNGPPASPSLSHEATPLSQTRSRG 

STNLF S KLTSKLTRSRNVSAEQKDENKEAKPRSLRFTWSMKTTS SMDPGDMMRE I RKVLD 

ANNCDYEQRERFLLFCVHGDGHAENLVQWEMEVCKLPRLSLNGVRFKRISGTSIAFKNIA 

SKIANELKL 

SEQ ID NO: 155_W90839_M 

KGPSWSSRSLGARCRNSIASCPEEQPHVGNYRLLRTIGKGNFAKVKIiARHILTGREVAIK 

IIDKTQLNPSSLQKLFREVRIMKGLNHPNIVKLFEVIETEKTLYLVMEYASAGEVFDYLV 

SHGRMKEKEARAKFRQIVSAVHYOIQKNIVHRDLKAENLLLDAEANIKIA^ 

GSKLDTFCGSPPYAAPELFQGKKYDGPEVDIWSLGVILYTLVSGSLPFDGHNLKELRERV 

LRGKYRVPFYMSTDCESILRRFLVLNPAKRCTLEQIMKDKWINIGYEGEELKPDTELKEE 

RMPGRKASCS AVGSGSRGLPPS SPIWS SAHNPNKAE I PERRKDSTSTPNNLPPSMMTRRN 

TYVCTERPGSERPSLLPNGKENSSGTSRVPPASPSSHSLAPPSGERSRLARGSTIRSTFH 

GGQVRDRRAGSGSGGGVQNGPPAS PTLAHEAAPLP S GRPRPTTNLFTKLTS KLTRRVTDE 

PERIGGPEVTSCHLPWDKTETAPRIjLRFPWSVKLTSSRPS 

SEQ ID NO: 156_406786 • 5_H 

MEVGGLTVFEEDQRCLSQSLPLPVSAEGPAAQTTAEPSRSFSSAHRHLSRRNGLSRLCQS 

RTALSEDRWSSYCLSSLAAQNICTSKLHCPAAPEHTDPSEPRGSVSCCSLLRGLSSGWSS 

PLLPAPVCNPNKAIFTVDAKTTEILVANDKACGLLGYSSQDLIGQKLTQFFLRSDSDWE 

ALSEEHMEADGHAAWFGTWDI ITRSGEKI PVSVWMKRMRQERRLCCVWLEPVERVST 

WVAFQSDGTITSCDSLFAHLHGYVSGEDVAGQHITDLIPSVQLPPSGQHIPKNLKIQRSV 

GRARDGTTFPLSLKLKSQPS S EEATTGE/^PVSGYRAS VWVFCTI SGLITLLPDGTIHG I 

NHSFALTLFGYGKTELLGKNITFLIPGFYSYMDLAYNSSLQLPDIASCLDVGN^ 

TLDPWQGQDPAEGGQDPRINVVIAGGHVVPRDEIRKLMESQDIFTGTQTELIAGGQLLSC 

LSPQPAPGVDNVPEGSLPVHGEQALPKDQQITALGREEPVAIESPGQDLLGESRSEPVDV 

KPFASCEDSEAPVPAEDGGSDAGMCGLCQKAQLERMGVSGPSGSDLWAGAAVAKPQAKGQ 

LAGGSLLMHCPCYGSEWGLWWRSQDLAPSPSGMAGLSFGTPTLDEPWLGVENDREE 

LIKEQLSQLSLAGALDVPHAELVPTECQAVTAPVSSCDLGGRDLCGGCTGSSSACYALAT 

DLPGGLEAVEAQEVDVNSFSWNLKELFFSDQTDQTSSNCSCATSELRETPSSLAVGSDPD 

VGSLQEQGSCVLDDRELLLLTGTCVDLGQGRRFRESCVGHDPTEPLEVCLVSSEHYAASD 

RESPGHVPSTLDAGPEDTCPSAEEPRLNVQVTSTPVIVMRGAAGLQREIQEGAYSGSCYH 

RDGLRLSIQFEVRRVELQGPTPLFCCWLVKDLLHSQRDSAARTRLFLASLPGSTHSTAAE 

LTGPSLVEVLRARPWFEEPPKAVELEGLAACEGEYSQKYSTMSPLGSGAFGFVWTAVDKG 

KNKEVVVKFIKKEKVLEDCWIEDPKLGKVTLEIAILSRVEHANIIKVLDIFENQGFFQLV 
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FIGURE II 

MEKHGSGLDLFAFIDRHPRLDEPIASYIFRQVRAGQSRLVSAVGYLRLKDIIHRDIKDEN 

IVIMDFTIKLIDFGSAAYLERGKLFYTFCGTIEYCAPEVLMGNPYRGPELEMWSLGVTL 

YTLVFEENPFCELEETVEAAIHPPYLVSKEIiMSLVSGLLQPVPERRTTLEKLVTDPWVTQ 

PVlSnjADYTWEEVFRVNKPESGVLSAASLEMGNRSLSDVAQAQEL 

HPGDPRLLTS 

SEQ ID NO: 157__AA544838_M 406786_M 

TRPHPCLDEPLASFIFRQLVSAVGYLHSQGIIHRDIKDENIVIAEDFTIKLIDFGSAAYL 
ERGKLFYTFCGTI EYCAPEVLIGNPYRGPELEMWSLGVTLYTLI FEENPFCEVEETMEAV 
IHPPFLVSQELMSLLSGLLQPCPEQRTTLEKLIRDPWVTQPVNLASYTWEEVCRTNQPES 
GLLSAASLEIGSRSPSEMAQREGLCGPPAPRETRGDQHCLHLKDPSLPVS 

SEQ ID NO: 158_AA785735_H 

MVMADGPRHLQRGPVRVGFYDIEGTIiGKGNFAWKLGRHRITKTEVAI KI IDKSQLDAVN 

LEKIYREVQIMKMLDHPHIIKLYQVMETKSMLYLVTEYAKNGEIFDYLANHGRL^^ 

RKFWQILSAVDYCHGRKIVHRDLKAENLLLDNNMNIKIADFGFGNFFKSGELIiATWCGSP 

PYAAPEVFEGQQYEGPQLDIWSMGVVLYVLVCGALPFDGPTLPILRQRVLEGRFRIPYFM 

SEDCEHLIRRMLVLDPSKRLTIAQIKEHKWMLIEVPVQRPVLYPQEQENEPSIGEFNEQV 

LRLMHSLGIDQQKXIESLQNKSYNHFAAIYFLLVERLKSHRSSFPVEQRLDGRQRRPSTI 

AEQTVAKAQTVGLPVTMHSPNMRLLRSALLPQASNVEAFSFPASGCQAEAA^ 

PKA^GCLIJDPVPPVLVRKGCQSLPSNMMETSIDEGLETEGEAEEDPAHAFEAFQS^^ 

RRHTLSEVTNQLVVMPGAGKIFSMNDSPSLDSVDSEYDMGSVQRDLNFLEDNPSLKDIML 

TVNQPSPRMTSPFISLRPTNPAMQALSSQKREVHNRSPVSFREGRRASDTSLTQGIVAFRQ 

HLQNL/UITKGILELNKVQLLYEQIGPEADPNLAPAAPQLQDLASSCPQEE^ 

LPASVHPQLSPRQSLETQYLQHRLQKPSLLSKAQNTCQLYCKEPPRSLEQQLQEHRLQQK 

RLFLQKQSQLQAYFNQMQIAESSYPQPSQQLPLPRQETPPPSQQAPPFSLTQPLSPVLEP 

SSEQMQYSPFLSQYQEMQLQPLPSTSGPRAAPPLPTQLQQQQPPPPPPPPPPRQPGAAPA 

PliQFSYQTCELPSAASPAPDYPTPCQYPVDGAQQSDLTGPDCPRSPGLQEAPSSYDPLAL 

SELPGLFDCEMLDAVDPQHNGYVLVN 

SEQ ID NO: 159_AA2 07220_H 

MESLVFARRSGPTPSAAEIARPIJVEGLIKSPKPLiMKKQAVKRHHHKHNLRHR^ 

KGTYGKVKKARESSGRLVAIKSIRKDKIKDEQDIJyiHIRREIEIMSSLNHPHIIAIHEV 

NSSKIVIVMEYASRGDLYDYISERQQLSEREARHFFRQIVSAVHYCHQNRWHRDLKLEN 

IliLDANGNIKIADFGLSNLYHQGKFLQTFCGSPLYASPEIVNGKPYTGPEVDSWSLGVLL 

YILVHGTMPFDGHDHKILVKQISNGAYREPPKPSDCLXGLIRWLLMVNPTRRATLEDVAS 

HWWVNWGYATRVGEQEAPHEGGHPGSDSARASMADWLRRSSRPLLENGAKVCSFFKQHAP 

GGGSTTPGLERQHSLKKSRKENDMAQSLHSDTADDTAHRPGKSNLKIjPKGILKKKVSASA 

EGVQEDPPELSPIPASPGQAAPLLPKKGILKKPRQRESGYYSSPEPSESGELLDAGDVFV 

SGDPKEQKPPQASGLLLHRKGILKLNGKPSQTALELAAPTTFGSLDEI^PPRPLARASRP 

sgavsedsilssesfdqldlperlpepplrgcvsvdnltgleeppsegpgsclrrwrqdp 
lgdscfsltdcqevtatyrqalrvcsklt 

seq id no: 160_aa426580_h, makjv^h 

mpaaagdgllgepaapgggggaedaarpaaacegsflpawvsgvprerlrdfqhhkrvgn 
yligsrklgegsfakvreglhvltgekvaikvidkkrakkdtyvtk^ 

PNITQLLDI LETENS YYLVMELCPGGNLMHKI YEKKRLEESEARRYI RQLI SAVEHLHRA 
GVVHRDLKIENLLIiDEDNNIKLIDFGLSNCAGILGYSDPFSTQCGSPAYAAPELLTUIKI^ 
GPKIDWSIGVNMYAMLTGTLPFTVEPFSLRALYQKI^KEMNPLPTQLSTGAISFLRSL 
LEPDPVKRPNIQQALANRWLNENYTGKVPCNVTYPNRISLEDLSPSVVLH^ 
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nCURE IJ 

DVINTVLSNRACHILAIYFLIiNKKLERYLSGKSDIQDSLCYKTRLYQIEKyRAPKESY 

SLDTWTRDLEFHAVQDKKPKEQEKRGDFLHRPFSKKLDKNLPSHKQPSGSLMTQIQOT 

LLKDRKASKSSFPDKDSFGCRNIFRKTSDSNCVASSSMEFIPVPPPRTPRIVKKPEPHQP 

GPGSTGIPHKEDPLiynJ^IWRSFESVDRDDHVEVLSPSHHYRILNSI^ 

PGLPSGSMSPLHTPLHPTLVSFAHEDKNSPPKEEGLCCPPPVPSNGPMQPLGSPNCVKSR 

GRFP^^4GIGQMLRKRHQSLQPSADRPLEASLPPIiQPIiAPWIiAFD^4ADGVKTQC 

SEQ ID NO: 161_Z36720_H 
MDTKLNMIJJEKVDQIiLHFQEDVTE^ 

DTQAGWPEVLELVRAMQQDAAQHGARLEALFRMVAAVDRAIALVGATFQKSKVADFL^ 

RVPWRRGSPGDSPEEWVKEEEVCFl^PVPPAPGAAGQSLQKDKGELSAEQGIWATLMTLV 

IMVTAANKERVEEEGGKPKHVLSTSGVQSDAREPGEESQKADVLEGTAERLPPIRASGLG 

ADPAQAWSPGQGIX?VPGPAQAFPGHLPLPTKVEAKAPETPSENLRTGLEIiAPAPGRVNV 

VSPSLEVAPGAGQGASSSRPDPEPLEEGTRLTPGPGPQCPGPPGLPAQARATHSGGETPP 

RAALLKGAVAPGFSRM5LWPS I FCACLGI SIHIQEMDTPGEMLMTGRGSLGPTLTTEAP 

-AAAQPGKQGPPGTGRCLQAPGTEPGEQTPEGARELSPLQESSSPGGVKAEEEQRAGAEPG 

TRPSLARSDDNDHEVGALGLQQGKSPGAGNPEPEQDCAARAPVRAEAVRRMPPGA^AGSV 

VLDDS PAPPAPFEHRWSVKETS I SAGYEVCQHEVLGGGRFGQVHRCTEKSTGLPLAAKI 

IKVKSAKDREDVKNEINIMNQLSHVNLIQLYDAFESKHSCTLVMEYVDGGELFDRITO 

YHLTELDVVLFTRQICEGVHYLHQHYILHI^LKPENILCVNQTGHQIKIIDFGUUUl^ 

REKLKVNFGTPEFLAPEVVNYEFVSFPTDMWSVGVITYMLLSGLSPFLGETDAET^ 

NCSWDFDADTFEGLSEEAKDFVSRIJ^VKEKSCRMSATQCLKHEWLNNLPAKASRSKra 

SQLLLQKYIAQRKWKKHFYWTAANRLRKFPTSP 

SEQ ID NO: 162_SGK088_H 

GEMALFECLVAGPTDVEVDWLCRGRLLQPALLKCKMHFDGRKCKLLLTSVHEDDSGVYTC 

KLSTAKDELTCSARLTWPSIAPLFTRLLEDVEVLEGRAARFDCKISGTPPPVVTW^ 

CPMEESENLRLRQDGGLHSLHIAHVGSEDEGLYAVSAVNTHGQAHCSAQLYVEEPRTAAS 

GPSSKLEKMPSIPEEPEQGELERLSIPDFLRPLQDLEVGLAKEAMLECQVTGLPYPTISW 

FHNGHRIQSSDDRRMTQYRDVHRLVFPAVGPQHAGVYKSVIANKLGKAACYAHLYV^ 

PGPPDGAPQVVAVTGRMVTLTWNPPRSLDMAIDPDSLTYTVQHQVLGSDQWTALVTGLRE 

PGWAATGLRKGVQHIFRVLSTTVKSSSKPSPPSEPVQLLEHGPTLEEAPAMLDKPDIVYV. 

VEGQPASVTVTFNHVEAQVVWRSCRGALLEARAGVYELSQPDDDQYCLRI CRVS RRDMGA 

LTCTARNRHGTQTCSVTLELAEAPRFES IMEDVEVGAGETARFAVWEGKPL PD I MWYKD 

EVLLTESSHVSFVYEENECSLVVTiSTGAQDGGVYTCTAQNIiAGEVSCKAELAVHSAQTA^ 

EVEGVGEDEDHRGRRLSDFYDIHQEIGRGAFSYLRRIVERSSGLEFAAKFIPSQAKPKAS 

ARREARLLARLQHDCVLYFHEAFERRRGLVIVTELCTEELLERIARK1>TVCESEIRAY^ 

QVLEGIHYLHQSHVLHLDVKPENLLVWDGAAGEQQWICDFGNAQELTPGEPQYCQ^ 

EFVAPEIVNQSPVSGVTDIWPVGWAFLCLTGISPFVGENDRTTLMNIRNYNVAFEETTF 

LSLSREARGFLIKVLVQDRLRPTAEETLEHPWFKTQAKGAEVSTDHLKLFLSRRRWQRSQ 

ISYKCHLVLRPIPELLRAPPERVWVTMPRRPPPSGGLSSSSDSEEEELEELPSVPRPLQP 

EFSGSRVSLTDIPTEDEALGTPETGAATPMDWQEQGRAPSQDQEAPSPEALPSPGQEPAA 

GASPRRGELRRGSSAESALPRAGPRELGRGLHKAASVELPQRRSPGPGATRLARGGLGEG 

EYAQRLQALRQRLLRGGPEDGKVSGLiRGPLLESLGGRARDPRMARAASSEAAPHHQPPLE 

NRGLQKSSSFSQGEAEPRGRHRRAGAPLEIPVARLGARRLQESPSLSALSEAQPSSPARP 

SAPKPSTPKSAEPSATTPSDAPQPPAPQPAQDKAPEPRPEPVRASKPAPPPQALQTLALP 

LTPYAQIIQSLQLSGHAQGPSQGPAAPPSEPKPHAAVFARVASPPPGAPEKRVPSAGGPP 

VLAEKARVPTVPPRPGSSLSSSIENLESEAVFEAKFKRSRESPLSLGLRLLSRSRSEERG 

PFRGAEEEDGIYRPSPAGTPLELVRRPERSRSVQDLRAVGEPGLVRRLSLSLSQRLRRTP 

PAQRHPAWEARGGDGESSEGGSSARGSPVIiAMRRRLSFTLERLSSRLQRSGSSEDSGG^^ 

iV lib 



wo 00/73469 



PCTAJSOO/14842 



FIGURE IK 

GRSTPLFGRLRRATSEGESLRRLGLPHNQLAAQAGATTPSAESLGSE^ 

RSRLRWGFSRPRKDKGLSPPNLSASVQEELGHQYVRSESDFPPVFHIKLKDQVLLEGEAA 

TLLCLPAACPAPHISWMKDKKSLRSEPSVIIVSCKDGRQLLSIPRAGKRHAGLYECSATN 

VLGSITSSCTVAVARVPGKIAPPEVTQTYQDTALVLWKPGDSRAPCTYTLERRVDGESVW 

HPVS SGI PDCT YlJm^LPVGVTVRFRVACT^NRAGQGPFSNSSEKVFWGTQDSSAVPSAA 

HQEAPVTSRPARARPPDSPTSLAPPLAPAAPTPPSVTVSPSSPPTPPSQALSSLKAVGPP 

PQTPPRRHRGLQAARPAEPTLPSTHVTPSEPKPFVLDTGTPIPASTPQGVKPVSSSTPVY 

VVTSFVSAPPAPEPPAPEPPPEPTKVTVQSLSPAKEWSSPGSSPRSSPRPEGTTLRQGP 

PQKPYTFLEEKARGRFGWRACRENATGRTFVAKIVPYAAEGKPRVLQEYEVLRTLHHER 

IMSLHEAYITPRYLVLIAESCGNRELLCGLSDRFRYSEDDVATY^4VQLLQGIiDYLHGHHV 

LHLDIKPDNLLLAPDNALKIVDFGSAQPYNPQALRPLGHRTGTLEFMAPEMVKGEPIGSA 

TDIWGAGVLTYIMLSGRSPFYEPDPQETEARIVGGRFDAFQLYPNTSQSATLFLRKVLSV 

HPWSRPSLQDCLAHPWLQDAYLMKLRRQTLTFTTNRLKEFLGEQRRRRAEAATRH 

SYPGGP 

SEQ ID NO: 163__AA542015_M SGK088_M 

ATDIWGAGVLTYIMLSGYSPFYEPDPQETEARIVGGRFDAFQLYPNTSQSATLFLRKVLS 
VHPWSRPSLQDCLAHPWIXJDAYLMKLRRQTLTFTTin^KEFLGEQRRRRAEAATRHK^ * 

RSYPGSP 

SEQ ID NO: 164_R19772_H 

MKGGDRAYTRGPSLGWLFAKCCCCFPCRDAYSHS S SENGGKSES VANLQAQPSLNF I HS S 
PGPKRSTNTLKKWLTSPVRRLNSGKADGNIKKQKJC\ny^ 

ADESKKGWGEDEPDEESHTPLPPPMKIFDNDPTQDEMSSSLLiAARQASTEVPTAADLVNA 

lEKLVKNKLSLEGSSYRGSLKDPAGCI^GMAPPTPPKNPEEEQKAKALRGRMFVIiNE 

QTEKDYVKDLGIVVEGFMKRIEEKGVPEDMRGKDKIVFGNIHQIYDWHKDFFIAELEKCI 

QEQDRIiAQLFIKHERKLHIYVWYCQNKPRSEYIVAEYDAYFEEVKQEINQRLTLSDFLIK 

PIQRITKYQLLLKDFLRYSEKAGLECSDIEKAVELMCLVPKROOTMMNIiGR^ 

AQGKLLQQDTFYVIELDAGMQSRTKERRVFLFEQIVIFSELLRKGSLTPGYMFKRSIKNN 

YLVLEENVDNDPCKFAIiMNRETSERVVLQAANADICKJAWVQDINQVLETQR^ 

lEYQRKERSTAVMRSQPARLPQASPRPYSSVPAGSEKPPKGSSYNPPLPPLKISTSNGSP 

GFEYHQPGDKFE7VSKNDLGGCNGTSSMAVIKDYYALKENEICVSQGEWQVIAVNQQNM 

LVYQPASDHSPAAEGWVPGSILAPLTKATAAESSDGSIKKSCSWHTLRMRKRAEVENTGK 

NEATGPRKPKDILGNKVSVKETNSSEESECDDLDPNTSMEILNPNFIQEVAPEFLVPLVD 

VTCLLGDWILQCKVCGRPKPTITWKGPDQNILDTDNSSATYTVSSCDSGEITLKICNLM 

PQDSGIYTCIATNDHGTTSTSATVKVQGVPAAPNRPIAQERSCTSVILRWLPPSSTGNCT 

ISGYTVEYREEGSQIWQQSVASTLDTYLVIEDLSPGCPYQFRVSASNPWGISLPSEPSEF 

VRLPEYDAAADGATISWKENFDSAYTELNEIGRGRFSIVKKCIHKATRKDVAVKFVNK^ 

KKKEQAAHEAALLQHLQHPQYITLHDTYESPTSYILILELMDDGRLL^ 

VAFYIRDIMEALQYLHNCRVAHLDIKPENLLIDLRIPVPRVKLIDLEDAVQISGH^^ 

LI/3NPEFAAPEVIQGIFVSLGTDIWSIGVLTYVMLSGVSPFLDESKEE^^ 

PHEYFCGVSNAARDFINVILQEDFRRRPTAATCLQHPWLQPHNGSYSKIPLDTSRLACFI 

ERRKHQNDVRP I PNVKS YI VNRVNQGT 

SEQ ID NO: 165_5R72__8_2_H 

MADSGIJDKKSTKCPDCSSASQKDVLCVCSSKTRVPPVLVVEMSQTSSIGSAESLISLERK 
KEKN INRD I TS RKDLPSRTSNVERKASQQQWGRGNFTEGKVPH I R I ENGAAI EE I YTFGR 
ILGKGSFGIVIEATDKETETKWAIKKVNKEKAGSSAVKLLEREVNILKSVKHEHIIHLEQ 
WETPKKMYLVMELCEDGELKEILDRKGHFSENETRWIIQSLASAIAYIJINOT 
LENIMVKSSLIDDNNEINLNIKNTTDFGIAVKKQSRSEAMLQATCGTPI.^^ 
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FIGURE IL 

SQQCDIWSIGVV^^mLLRGEPPFIASSEAKLFELIRKGELHFENAVVmSISDCAKSVLKQ 
LMKVDPAHRITAKELIJ:)NQWLTGNKLSSWPTNVLEiy^ 

TEEKLKSYQPWGNVPEmrrSDEEEEKQSTAYEKQFPATSKDNFDMCSSSFTSSKLLPAE 
I KGEMEKTPVTPSQGTATKY PAKSGALSRTKKKL 

SEQ ID NO: 166_SGK309_H 

MQCLAAALKDETNMSGGGEQADILPANYWKDRWKVLKKIGGGGFGEIYE 

ALKVESAQQPKQVLKMEVAVLKKLQGSGLGQGIX3KEEMMKPGAKRGKDHV 

KFNYV^QLQGRNLADLRRSQPRGTFTLSTTLRLGKQ I LES I EAIHS VGFLHRDI KPSNF 

AMGRLPSTYRKCYMLDFGLARQYTtnTGDVRPPRNVAGFRGTVRYASV^^ 

DLWSLFYMLVEFAVGQLPWRKIKDKEQVGMIKEKYEHRMLLKHMPSEFHLFLDHIASLDY 

FTKPDYQLIMSVFENSMKERGIAENEAFDWEKAGTDALLSTSTSTPPPAEHPADGSHVWG 

GQCDASAWGPAPGEHRGCATGRAPEXPGECTPNSAREALXGAGPQSPPCPPPRGSXGXSL 

GGDRCQPEQTPDQHRQSNCRQGEGRGWPFLSPPIPSLVPLPCSSXAPCPPPISLLARPLF 

PVPSPALASLCLPSSSSSVSFTLRRPSA 

SEQ ID NO: I67_AA234451_H 

MSGGGEQIJDILSVGILVKERWK\njRKIGGGGFGEIYDALDMLTRENVALKVESAQQPKQV 
LKMEVAVLKKLQGKDHVCRFIGCGRNDRFNYVVMQLQGRNLADLRRS 
LGRQILESIESIHSVGSXHRDIKPSNFAMGRFPSTCRKCYMLDFGLARQFTNSCGDVRPP 
RAVAGFRGTVRYAS INAHRNREMGRHDDLWS LFYMLVEF WGQLPWRKI KDKEQVGS I KE 
RYDHRLMLKHLPPEFSIFLDHISSLDYFTKPDYQLLTSVFDNSIKTFGVIESDPFDWEKT 
GNDGSLTTTTTSTTPQLHTRLTPAAIGIANATPI PGDLLRENTDEVFPDEQLSDGENGI P 
VGVSPDKLPGSLGHPRPQEKDVWEEMDANKNKIKLGICKAATEEENSHGQANGLLNAPSL 
GSPIRWSEITQPDRDIPLVRKLRSIHSFELEKRLTLEPKPDTDKFLETWYKIVYFSF 

SEQ ID NO: 168_AA435956_H 

TFTIFFEMTVFDLEAKSARGGSNLLI^SVSSFQLFMFQLLRGLAYIHHQHVIjHRDLKPQN 
LLISHLGELKLADFGIiARAKSIPSQTYSSEVVTLVryRPPDAIJJ^ 

FIEMFQGQPLFPGVSNILEQLEKIWEVLGVPTEDTWPGVSKLPNYNPEWFPLPTPRSLHV 
VWNRIiGRVPEAEDLASQMLKGFPRDRVSAQEALVHDYFSALPSQLYQLPDEESLFTVSGV 
RLKPEMCDLLASYQKGHHPAQFSKCW 

SEQ ID NO: 169_AA626859__H 

NGVADGVI KS VLWQTLQALNFCHI HNCIHRDI KPENI LI TKQGII KI CDFGFAQI LI PGD 
AYTDYVATRWYRAPELLVGDTQYGSSVDIWAIGCVFAELLTGQPLWPGKSDVDQLYLIIR 
TLGKL I PRHQS I FKSNGFFHGI S I PEPEDMETLEEKFSDVHPVALNFMKGCLKMNPDDRL 
TCSQLLESSYFDSFQEAQIKRKARNEGRNRRRQQNQLLPLIPGSHISPTPDGRKQVLQLK 

FDHLPNI 

SEQ ID NO: 170_AA061797_M 

KI ALRE IRMLKLKHPNLVNLI EVFRRIOmiHLVFEYCDHTLLNELERNPNGVSDGVI KSV 
LWQTLQALNFCHKHNCIHRDVKPENILI TKQGMI KI CDFGFARI LI PGDAYTDYVATRWY 
RAPELLVGDTKYGSSVDVWAVGCVFAELLTGQPLWPGKSDVDQLYLI IRTLGKLI PRHQS 
I FRSNQFFRGI S I PEPEDMETLEEKFSNVQPVALSFMKGCLKMNPDERLTCAQLLDS AYF 
ESFQEDQMKRKARSEGRSRRRQQNQLLPLI PGSHI SPTPDGRKQWQLKFDHLPNI 

SEQ ID NO: 171_AA3 97553_H 

MPNSERHGGKKDGSGGASGTLQPSSGGGSSNSRERHRLVSKHKRHKSKHSKDMGLVTPEA 
ASLGTVIKPLVEYDDISSDSDTFSDDMAFKLDRRENDERRGSDRSDRLHKHRHHQHRRSR 
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FIGURE IM 



DLLKAKQTEKEKSQEVSSKSGSMKDRISGSSKRSNEETDDYGKAQVAKSSSKESRSSKLH 

KEKTRKERELKSGHKDRSKSHRKRETPKSYKTVDSPKRRSRSPHRKWSDSSKQDDSPSGA 

SYGQDYDLSPSRSHTSSNYDSYKKSPGSTSRRQSVSPPYKEPSAYQSSTRSPSPYSRRQR 

SVSPYSRRRSSSYERSGSYSGRSPSPYGRRRSSSPFLSKRSLSRSPLPSRKSMKSRSRSP 

AYSRHSSSHSKKKRSSSRSRHSSISPVRLPLNSSLGAELSRKKKERAAAAAAAKMDGKES 

KGS PVFLPRKENS SVEAKDSGLESKKLPRS VKLEKSAPDTELVNVTHLNTE V 

VKLDENSEKHLVKDLKAQGTRDSKPIALKEEIVTPKETETSEKETPPPLPTIASPPPPLP 

TTTPPPQTPPLPPLPPIPALPQQPPLPPSQPAFSQVPASSTSTLPPSTHSKTSAVSSQAN 

SQPPVQVSVKTQVSVTAAIPHLKTSTLPPLPLPPLLPGGDDMDSPKETLPSKPVKKEKEQ 

RTRHLLTDLPLPPELPGGDLSPPDSPEPKAITPPQQPYKKRPKICCPRYGERRQTESDWG 

KRCVDKFDI IGI IGEGTYGQVYKARDKDTGELVALKKVRLDNEKEGFP I TAIREI KI LRQ 

LIHRSVVNMKEIVTDKQDALDFKMDKGAFYLVFEYMDHDLMGL^ 

KQLMEGLEYCHKKNFLHRDIKCSNIIiIim^SGQIK^^ 

YRPPELIiLGEERYTPAIDVWSCGCIIiGELFTKKPIFQANLELAQLELISRLCGSPCPA 

PDVIKLPYFNTMKPKKQYRRRLREEFSFIPSAALDIiliDHMLTLDPSKRCTAEQTLQSDFL 

KDVELSKMAPPDLPHWQDCHELWSKKRRRQRQSGVVVEEPPPSKTSRKETTSGTSTEP^ 

NSSPAPPQPAPGKVESGAGDAIGI^ITQQLNQSEIiAVLLl^I^SQTDL I PQMAQLLNI 

HSNPEMQQQLEALNQS I SALTEATSQQQDSETMAPEESLKEAPS APVI LPSAEQMTLEAS 

STPADMQNIIiAVLLSQIiMKTQEPAGSIiEENNSDKlSrSGPQGPRRTPTMPQEEA 

PPEKRPPEPPGPPPPPPPPPLVEGDLSSAPQELNPAVTAALLQLLSQPEAEPPGHLPHEH 

QAIiRPMEYSTRPRPNRTYGNTDGPETGFSAIDTDERNSGPALTESLVQTLVKNRTFSGSL 

SHLGESSSYQGTGSVQFPGDQDLRFARVPLALHPWGQPFLKAEGSSNSVVHAETKIiQ^ 

GELGPGTTGASSSGAGLHWGGPTQSSAYGKLYRGPTRVPPRGGRGRGVPY 

SEQ ID NO: 172_AA789239_H 

MEMYETLGKVGEGSYGTVMKCKHKNTGQIVAIKIFYERPEQSVNKIAMREIKFLKQFHHE 
NLVNLIEVFRQKJCKIHLVFEFIDHTVLDELQHYCHGLESKRI^^ 

VIIHRDIKPENILVSQSGITKLCDFGFARTLAAPGDIYTDYVATRWYRAPELVLKDTSYG 

KYVPVDIWALGCMIIEMATGNPYLPSSSDLDLLHKIVLKVXFMPELKAKLLQEAKVN 

KPKESSKENELRKDERKTVYTNTLLSSSVLGKEIEKEKKPKEIKVRVIK^ 

KKKEYEGGLGQQDANENVHPMSPDTKLVTIEPPNPINPSTNCNGLKENPHCGGSV™ 

NLTNSNimANLSSNLFHPSWLTERAKKRRTSSQSIGQVMPNSRQEDPGPIQSQ 

FNERTGHSDQMANENKRKLNFSRSDRKEFHFPELPVTIQSKDTKGMEVKQIKMLKRESK^ 

TESSKIPTLLNVDQNQEKQEFIPLSLLSACCPIFTNICSQLTIRVEMAIARGRI 

SEQ ID NO: 173_AA124976_M 

LADIVHACLQIDPAERTSSTDLLRHDYFTRDGFIEKFIPELRAKLLQEAKVNSFI^ 

FKENEPVRDEKKSWTNTLLYGNPSLYGKEVDRDKRAKELKVRVIKAKGGKGDVP 

EYEGDHRQQGTADDTQPSSLDKKPSVLELTNPLNPSENSDGVKEDPHAGGCMIMPPINLT 

SSNLLAANLSSNLSHPNSRLTERTKKRRTSSQTIGQTLSNSRQEDTGPTQVQTEKGAFNE 

RTGQNDQI SSGNKRKLNFPKCDRKEFHFPELPFTVQAKEMKGMEVKQI KVLKRESKKTDS 

SKIPTLLSMDPNQEKQEGGDGDCEGKNLKRNRFFFSR 

SEQ ID NO: 174_AA575635_M CCRK_M 

SASGQLKIADFGLARVFSPDGGRLYTHQVATRWYRAPELLYGARQYDQGVDLWAVGCIMG 
ELLNGSPLFPGENDIEQLCCVLRILGTPSPRVWPEITELPDYNKISFEEQAPVPLEEVLP 
DASPQALDLLGQFLLYPPRQRIAASQALLHQYFFTAPLPAHPSELPIPQRPGGPAPKAHP 
GPPHVHDFHVDRP I EESLLNPELI RPF I PEG 




WEST 
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SEQ ID NO: 175_AA631990_H 

MITSISTEKSGHTHYPFMITTLQYYRGRGGKTAVWRHFSAEGPFAFAEMRHSKRTHCPDW 

DSRESWGHESYRGSHKRKRRSHSSTQENRHCKPHHQFKESDCHYLEARSLNERDYRDI^ 

VDEYRNDYCEGYVPRHYHRDIESGYRIHCSKSSVRSRRSSPKRKRNRHCSSHQSRSXEIV 

DTLGEGAFGKVVECIDHGMDGMHVAVKIVKNVGRYREAARSEIQVLEHL^ 

VQMLEWFDHHGHVCIVFELLGLSTYDFIKENSFLPFQIDHIRQMAYQICQSINFLHHNKL 

THTDLKPENILFVKSDYVVKYNSKMKRDERTLKNTDIK\AnDFGSAT^ 

YRAPEVILALGWSQPCDVWS I GC I LI EYYLGFTVFQTHDSKEHLAMMERI LGP I PQHMI Q 

KTRKRKYFHHNQLDWDEHSSAGRYVRRRCKPLKEFMLCHDEEHEKLFDLVRRMLEYDPTQ 

RITLDEALQHPPFDLLKKK 

SEQ ID NO: 176_AA557536_H 

MCTWDPRIVRRYLLRRQLGQGRTFREITLLQVSGLGPPVQSPCPGTDLSRQERNWPSWA 
PEHSPSWPSSRLRLSPQEFGDHPNIISLLDVIRAENDRDIYLVFEFMDTDLNAVIRKGGL 
LQDVHVRSIFYQLLRATRFLHSGHVVHRDQKPSNVLLDANCTVKLCDFGLARSLGDLPEG 
PEDQAVTEYVATRWYRAPEVLLSSHRYTASCPRYTLGVDMWSLGCILGEMLRGRPLFPGT 
STLHQLELILETIPPPSEEXRPRQTLDALLPPDTSPEALDLLRRLLVFAPDKRLSATQAIi 
QHPYVQRFHCPSDEWAREADVRPRAHEGVQLSVPEYRSRVYQMILECGGSSGTSREKGPE 
GVSPSQAHIiHKPRADPQLPSRTPVQGPRPRPQSSPGHDPAEHESPRAAKNVPRQNSAPLL 
QTALLGNGERPPGAKEAPPLTLSLVKPSGRGAAPSLTSQAAAQVANQT^IRGDWNRGGGV 
RVASVQQVPPRLPPEARPGRRMFSTSALQGAQGGARALLGGYSQAYGTVCHSALGHLPLL 
EGHHV 

SEQ ID NO: 177_N28606_H, MOK_H 

MKNYKAIGKI GEGTFSEVMKMQSLRDGNYYACKQMKQRFES I EQVNNLRE I QALRRLNPH 

PNILMLHEWFDRKSGSLALICELMDMNIYELIRGRRYPLSEKKIMHYM^ 

RNGIFHRDVKPENILIKQDVLKLGDFGSCRSVYSKQPYTEYISTRWYRAPECLLTDGFYT 

YKMDLWSAGCVFYEIASLQPLFPGVNELDQISKIHDVIGTPAQKILTKFKQSRAMNFDFP 

FKKGSGIPIiLTTNLSPQCLSLLHAMVAYDPDERIAAHQALQHPYFQEQRKTEKRALGSHR 

KAGFPEHPVAPEPLSNSCQISKEGRKQKQSLKQEEDRPKRRGPAYVMELPKLKLSGWRL 

SSYSSPTLQSVLGSGTNGRVPVLRPLKCIPASKKTDPQKDLKPAPQQCRLPTIVRKGGR 

SEQ ID NO: 178_AB023153_H/ ICK_H 

MNRYTTIRQLGDGTYGSVLLGRSIESGELIAIKKMKRKPYSWEECMNQREVKSLKK^ 

NVVKLKEVIRENDHLYFIFEYMKENLYQLIKERNKLFPESAIRNI^4YQILQGIi;^ 

FFHRDLKPENLLCMGPELVKIADFGLAREIRSKPPYTDYVSTRWYRAPEVLLRSTNYSSP 

IDVWAVGCIMAEVYTLRPLFPGASEIDTIFKICQVLGTPKKTDWPEGYQLSSAMNFRWPQ 

CVPNNLKTLIPNASSEAVQLLRDMLQWDPKKRPTASQALRYPYFQVGHPLGSTTQNLQDS 

EKPQKGILERAGPPPYIKPVPPAQPPAKPHTRISSRQHQASQPPIiHLTYPYKAEVSRTDH 

PSHLQEDKPSPLLFPSLHNKHPQSKITAGLEHKNGEIKPKSRRRWGLISRSTKDSDDWAD 

LDDLDFSPSLSRIDLKNKKRQSDDTLCRFESVLDLKPSBPVGTGNSAPTQTSYQRRDTPT 

LRSAAKQHYLKHSRYLPGISIRNGILSNPGKEFIPPNPWSSSGLSGKSSGTMSVISKVNS 

VGSSSTSSSGLTGNYVPSFLKKEIGSAMQRVHliAPIPDPSPGYSSLKAMRPHPGRPFLDT 

QPRSTPGLIPRPPAAQPVHGRTDWASKYPSRR 

SEQ ID NO: 179_AA839940_M 

SSNNGGMSAEEEIGPGAEPMRGPSLATRDWRDETVGTTDLQQGIDPGAVSPEPGKDHAAQ 
GPGRTEAGRVS SAAEAAI WLDDSAAPPAPFEHRWS I KDTL I SAGYTVSQHE VLGGGRF 
GQVHRCTERS TGLALAAKI I KVKNVKDREDVKNEVNIMNQL^ 

LIMEYVDGGELFDRITDEKYHLTELDVVLFTRQICEGVHYLHQHYILHLDLKPENILC^^ 
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QTGHQIKIIDFGLARRYKPREKLKVNFGTPEFI^PEVVNYEF^SFPTDMWSVGVIT^^ 

SGLSPFLGETDAETMNFIWCSWDFD7VDTFKGLSEEAKDFVSRLLVKEKSCRMSATQCLK 

HEWLNHLPAKASGSin/RLRSQQLLQKYMAQSKWKKHFHWAA 

SEQ ID NO: 180_AA460132_H 

MAAARATTPADGEEPAPEAEAIiAAARERSSRFLSGLELVKQGAEARVFRGRFQGRAAVIK 
HRFPKGYRHPALEARLGRRRTVQEARALIiRCRRAGI S APWF FVDYASNCLYMEE I EGSV 
TVRDYIQSTMETEKTPQGLSNLAKTIGQVLARMHDEDLIHGDL 

LIDFGLSFI SALPEDKGVDLYVLEKAFLSTHPNTETVFEAFLKSYSTS S KKARPVLKKLD 
EVRLRGRKRSMVG 

SEQ ID NO: 181_SGK034_H 

QREKWQGNMPGIiQSTFLAMDTEEGVEVVWNELxHFGDRKAFAAHEEKI^ 

PNIVKLHKYWLDTSEACARVIFITEYVSSGSLKQFLKKTKKNHKAh^ 

ALSFIiHACSPPIIHGNLTSDTIFIQHNGLIKIGSVWHRIFSNALPDDLRSPIRAEREELR 

NLHFFPPEYGEVADGTAVDIFSFGMCMiEMAVLEIQTNGDTRVTEEAIARARHSLSDPN^ 

REFILCCIARDPARRPSAHSLLFHRVLFEVHSLKLLAAHCFIQHQYLMPENV\^ 

DLHAVLAELPRPRRPPLQWRYSEVSFMELDKFLEDVRNGIYPLMNFAATR 

PPEEVQKAKTPTPEPFDSETRKVIQMQCNLERSEDKARWHLTLLLVLEDRLHRQLTYDLL 

PTDSAQDLASELVHYGFLHEDDRMKLAAFLESTFLKYRGTQA 

SEQ ID NO: 182_AA1032 18_M SGK034_M 

HASAPEYGEVNDGTGFVDIFSFGMCT^EMAVLEIQANGDTRVTEEAIARARHSLSDPNI^ 

EFILSCIiARDPARRPSAHNLLFHRVLFEVHSLKLIJ^CFIQHQYLMPENVVEEKTK^^ 

LHAVLAEMPQPHGPPMQWRYSEVSFLELDKFLEDVRNGIYPLMNFAAARPLGLPRV^ 

PEEAQKAKTPTPEPFDSETRKWQMQCNLERSEDKARWHLTLLLVLEDRLHRQLTYDLLP 

TDSAQDLAAELVHYGFLHEDDRTKLAAFLETTFLKYRGTQA 

SEQ ID NO: 183__NEK7_H, N34132_H 

MSGGAAEKQSSTPGSLFLSPPAPAPKNGSSSDSSVGEKLGAAAADAVTGRTEEYIWiRRHT 
MDKDSRGAAATTTTTEHRFFRRSVI CDSNATALELPGLPLSLPQPS I PAAVPQSAPPEPH 
REETVTATATSQVAQQPPAAAAPGEQAVAGPAPSTVPSSTSKDRPVSQPSLVGSKEEPPP 
ARSGSGGGSAKEPQEERSQQQDDIEELETKAVGMSNDGRFLKFDIEIGRGSFKTVYKGLD 
TETTVEVAWCELQDRKLTKSERQRFKEEAEMLKGLQHPNIWFYDSWESTVKGKKCIVLV 
TELMTSGTLKTYLKRFKVMKIKVLRSWCRQILKGLQFLHTRTPLI IHRDLKCDNI FITGP 
TGSVKIGDLGLATLKRASFAKSVIGTPEFMAPEMYEEKYDESVDVYAFGMCMLEMATSEY 
PYSECQNAAQIYRRWSGVKPASFDK^IPEVKEIIEGCIRQNKDERYSIKDLLNHAFFQ 
EETGVRVELAEEDDGEKI AI KLWLRI EDI KKLKGKYKDNEAI EFCFDLERDVPEDVAQEM 
VESGYVCEGDHKTMAKAIKDRVSLIKRKREQRQLVREEQENKKQEESSLKQQVEQSSASQ 
TGIKQLPSASTGIPTASTTSASVSTQVEPEEPEADQHQQLQYQQPSISVLSDGTVDSGQG 
SSVFTESRVSSQQTVSYGFPXHEQAHSTGTVPGHIPSTVQAQSQPHGVYPPSSVQQGIQQ 
TAPPQQTVQYSLSQTSTSSEATTAQPVSQPQAPQVIiPQVSAGKQSTQGVSQVAPAEPVAV 
AQPQATQPTTLAS S VDS AHSDVASGMSDGNENVPS S SiSRHEGRTTKRHYRKSVRSRSRHE 
KTSRPKLRILNVSNKGDRVVECQLETHNRKMVTFKFDL^ 

RESFVDQVREI lEKADEMLSEDVSVEPEGDQGLESLQGKDDYGFSGSQKLEGEFKQPI PA 
SSMPQQIGIPTSSLTQVVHSAGRRFIVSPVPESRLRESK\^PSEITDTVAASTAQSPGMN 
LSHSASSLSLQQAFSELRRAQMTEGPNTAPPNFSHTGPTFPWPPFLSSIAGVPTTAAAT 
APVPATSSPPNDISTSVIQSEVTVPTEEGIAGVATSTGWTSGGLPIPPVSESPVLSSW 
SSITIPAWSISTTSPSLQVPTSTSEIWSSTALYPSVTVSATSASAGGSTATPGPKPPA 
WSQQAAGSTTVGATLTSVSTTTSFPSTASQLSIQLSSSTSTPTLAETVVVSAHSLDKTS 
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HSSTTGIAFSLSAPSSSSSPGAGVSSYISQPGGLHPLVIPSVIASTPILPQAAGPTSTPL 
LPQVPSIPPLVQPVANVPAVQQTLIHSQPQPALLPNQPHTHCPEVDSDTQPKAPGIDDIK 
TLEEKLRSLFSEHSSSGAQHASVSLETSLVIESTVTPGIPTTAVAPSKLLTSTTSTCLPP 
TNLPLGTVALPVTPWTPGQVSTPVSTTTSGVKPGTAPSKPPLTKAPVLPVGTELPAGTL 
PSEQLPPFPGPSLTQSQQPLEDLDAQLRRTLSPEMITVTSAVGPVSMAAPTAITEAGTQP 
QKGVSQVKEGPVIiATSSGAGVFKMGRFQVSVAADGAQKEGKNKSEDAKSVHFESSTSESS 
VLSSSSPESTLVKPEPNGITIPGISSDVPESAHKTTASEAKSDTGQPTKVGRFQVTTTAN 
KVGRFSVSKTEDKITDTKKEGPVASPPFlVaDLEQAVLPAVIPKKEKPELSEPSHLNGPSSD 
PEAAFLSRDVDDGSGSPHSPHQLSSKSLPSQNLSQSLSNSFNSSYMSSDNESDIEDEDLK 
LELRRLRDKHLKEIQDLQSRQKHEIESLYTKLGKVPPAVIIPPAAPLSGRRRRPTKSKGS 
KSSRSSSLGNKSPQLSGNLSGQSAASVLHPQQTLHPPGNIPESGQNQLLQPLKPSPSSDN 
LYSAFTSDGAI SVPSLSAPGQGNKATI I VQKQ 

SEQ ID NO: 184_BCON3_H 

MSEGESQTVLSSGSDPKVESSSSAPGLTSVSPPVTSTTSAASPEEEEESEDESEILEESP 

CGRWQKRREEVNQRNVPGIDSAYLAl^TEEGVEVVWNEVQFSERKNY^ 

L I QLEHLNI VKFHKYWADI KENKARVI FITEYMS SGSLKQFLKKTKIO!^ 

CTQILSALSYLHSCDPPI IHGNLTCDTIFIQHNGLI KIGSVAPDTINNHVKTCREEQK^ 

HFFAPEYGEVTNVTTAVDIYSFGMCALEMAVLEIQGNGESSYVPQEAISSAIQLLEDPLQ 

REFIQKCLQSEPARRPTARELLFHPALFEVPSLKLLAAHCIVGHQHMIPENALEEITKNM 

DTSAVLAEIPAGPGREPVQTLYSQSPALELDKFLEDVRNGIYPLTAFGLPRPQQPQQEEV 

TSPVVPPSVKTPTPEPAEVETRKVVLMQCNIESVEEGVKHHLTLLLKLEDKI^^ 

MPNENI PELAAELVQLGFI SEADQSRLTSLLEETIJ^KFNFARNSTLNSAAV^ 

SEQ ID NO: 185_AA711829_M 

LKQFLKKTKKNHKTMNEKAWKRWCTQILSALSYLHSCDPPIIHGNLTC^^ 

IGSVAPDTINNHVKTCREEQKNLHFFAPEYGEVTNVTTAVDIYSFGMCAI^^ 

GESSYVPQEAISSAIQLLEDSLQREFIQKCLQSEPARRPTARELLFHPALFEVPSLKLLA 

AHCIVGHQHMIPENALEEITKNMDTSAVLAEIPAGPGREPVQTLYSQSPAIiELDKFLEDV 

RNGIYPLTAFGLPRPQQPQQEEVTSPWPPSVKTPTPEPAEVEraKVVLMQCNIESVEEG 

VKHHLTLLLKLEDKLNRHLSCDLMPNESIPDLAAELVQLGFISEADQS 

FNFTRNSTLNTATVTVSS 

SEQ ID NO: 186_AA099102_H 

MSSCVSSQPSSNRAAPQDELGGRGSSSSESQKPCEALRGLSSLSIHLGMESFIWTECEP 

GCAVDLGLARDRPLEADGQEVPLDTSGSQARPHLSGRKLSLQERSQGGIiAAGGSLDMNGR 

CICPSLPYSPVSSPQSSPRLPRRPTVESHHVSITGMQDCVQLNQYTLKDEIGKGSYGWK 

lAYNENDNTYYAMKVT^SKKKLIRQAAFPRRPPPRGTRPAPGGCIQPRGPIEQVYQEIAIL 

KKLDHPNVVKLVEVLDDPNEDHLYMVFELVNQGPVMEVPTLKPLSEDQAR 

EYLHYQKIIHRDIKPSNLLVGEDGHIKIADFGVSNEFKGSDAIiLSNYVGTPAFMAPESLS 

ETRKIFSGKAIODWAMGVTLYCFVFGQCPFMDERIMCLHSKIKSQALEFPDQPDIAEDLK 

DLITRMLDKNPESRIWPEIKLHPWVTRHGAEPLPSEDENCTLVEVTEEEVENSVKHIPS 

lAWILVKTMIRKRSFGNPFEGSRREERSLSAPGNIJ^TKICPTRECESLSELKEARQ 

PGHRPAPRGGGGSALVRGSPCVESCWAPAPGSPARMHPLRPEEAMEPE 

SEQ ID NO: 187_5R69_17_2_H 

MQEIPQEQIKEIKKEQLSGSPWILLRENEVSTLYKGEYHRAPVAIKVFKKLQAGSIAIVR 
QTFNKEIKTMKKFESPNILRIFGICIDETVTPPQFSIVMEYCELGTLRELLDREKDLTLG 
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KRMVLVLGAARGLYRIJ^HSEAPEIiHGKIRSSNFLWQGYQVKL^ 

REKTDRVKSTAYLSPQELEDVFYQYDVKSEIYSFGIVLWEIATGDIPFQGEECEDWLSQW 
L 

SEQ ID NO: 188_H85811_H 

MAPVYEGMASHVQVFSPmijQSSAFCSVKKLKIEPSSNWDMTGYGSHSKVYSQSKNI 

QPATTTVSTSLPVPNPSLPYEQTIVFPGSTGHIVVTSASSTSVTGQVLGGPHNLMRRSTV 

SLLDTYQKCGLKRKSEEIENTSSVQIIEEHPPMIQNNASGATVATATTSTATSKNSGSNS 

EGDYQLVQHEVLCSMTNTYEVLEFLGRGTFGQVVKCWKRGTNEIVAIKILKNHPSYARQ 

Q I EVS I LARLSTESADDYNFVRAYECFQHKNHTCLVFEIVDJEQNLYDFLKQNKFS PL 

IRPVLQQVATAIiMKLKSLGLIHADLKPENIMLVDPSRQPYRVKVIDFGSASHVSKAVCST 

YLQSRYYRAPEIILGLPFCEAIDMWSIiGCVIAELFLGWPLYPGDSEYDQIRYISQTQGLP 

AEYIiLSAGTKTTRFFNRDTDSPYPLWRLKTPDDHEAETGIKSKEARKYIFNCLDDMAQVN * 

MTTDLEGSDl^VEKADRREFIDLLKKMLTIDADKRITPIETLNHPFV^^ 

VKSCFQNMEI CKRRVNKSHDTVNQSKTPF I THVAPSTSTNLTM^ 

AAVAQRSMPLQTGTAQICARPDPFQQALIVCPPGFQGLQASPSKHAGYSVRMENAVPIVT 
QAPGAQPLQIQPG]^QQAWPSGTQQILLPPAWQQLTGVATHTSVQHATVIPETMAGTQQ 
IJUDVmNTHAHGSHYNPIMQQPALLTGHVTLPAAQPLNVGVAHV^ 

QSSVRlArSTCEVSSSQAISSPQRSKRVKENTPPRCAMVHSSPACSTSVTCGWGDVASST^ 
RERQRQTIVIPDTPSPTVSVITISSDTDEEEEQKHAPTSTVSKQRKNVISCVTVHDSPYS 
DSSSNTSPYSVQQRAGHNNANAFDTKGSLENHCTCaaPRTIIVPPLKTQASEVliVECDSLV 
PVNTSHHSSSYKSKSSSNVTSTSGHSSGSSSGAITYRQQRPGPHFQQQQPLNLSQAQQHI 
TTDRTGSHRRQQAYITPTMAQAPYSFPHNSPSHGTVHPHLAAAAAAAHLPTQPHLYTYTA 
PJVALGSTGTVT^VASQGSARHTVQHTAYPASIVHQVPVSMGPRVLPSPTIHPSQYPAQF 
AHQTYI SASPASTVYTGYPLSPAKVNQYPYI 

SEQ ID NO: 189_DYRK3_H 

MMIDETKCPPCSNVLCNPSEPPPPRRLNMTAEQFTGDHTQHFIXIGGEMKVEQLFQEFGN 
KSNTIQSDGISDSEKCSPTVSQGKSSDCLNTVKSNSSSKAPKWPLTPEQALKQYKHH 
AYEKLEI INYPEI YFVGPNAKKRHGVIGGPNNGGYDDADGAYIHVPRDHIiAYRYEVLKI I 
GKGSFGQVARVYDHKLRQYVALKMVRNEKRFHRQAAEEIRILEHLKKQDKTGSMNVIH 
ESFTFRNHVCmFELLSIDLYELIKKNKFQGFSVQLVRKFAQSI LQSLDALHKNKI IHCD 
LKPENILLKHHGRSSTKVIDFGSSCFEYQKLYTYIQSRFYRAPEIILGSRYSTPIDIWSF 
RCILAELLTGQPLFPGEDEGDQIACMMELLGMPPPKLLEQSKRAKYFINSKGIPRYCSV^ 
TQADGRWLVGGRSRRGKKRGPPGSKDWGTALKGCDDYLF I EFLKRCLHWDPSARLTPAQ 
ALRHPWISKSVPRPLTTIDKVSGKRVVNPASAFQGIXSSKLPPWGIANKLKANIiMSETNG 

SIPLCSVLPKLIS 

SEQ ID NO: 190_AA589241_M DYRK3_M 

TRPELLGMPPQKLLEQSKRAKYFINSKGLPRYCSVSTQTDGRVVLLGGRSRRGKKRGPPG 
SKDWATALKGCGDYLFIEFLKRCLQWDPSARLTPAQALRHPWISKSTPKPLTMDKVPGKR 
VVNPTNAFQGIiGSKLPPWGIASKIJCANLMSETSGSIPLCSVLPK^ 

SEQ ID NO: 191_5R72_i6_2_H 

MAGGRGAPGRGRDEPPESYPQRQDHELQALEAIYGADFQDLRPDACGPVKEPPEINLVLY 

PQGLTGEEVYVKVDLRVKCPPTYPDWPEIELKNAKGLSNESWLLKSRLEELAKK^ 

VMIFELAYHVQSFLSEHNKPPPKSFHEEMLERRAQEEQQRLLEAKRKEEQEQREILHEIQ 

RRKEEIKEEKKRKEMAKQERLEIASLSNQDHTSKKDPGGHRTAAILHGGSPDFVGNGKHR 

ANSSGRSRRERQYSVCNSEDSPGSCEILYFNMGSPDQLMVHKGKCIGSDEQLGKLVYNAL 

ETATGGFVLLYEWVUJWQKKMGPFLTSQEKEKIDKCKKQIQGTETEFNSLVKLSHPNVV^ 
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YLAMNLKEQDDSIWDILVEHISGVSLAAHLSHSGPIPVHQLRRYTAQLLSGLDYLHSNS 

VVHKNTLSASNVLVDAEGTWITDYSISKRIJ^ICKEDVFEQTRVRFSD 

VWRLGLLLLSLSQGQECGEYPVTIPSDLPADFQDFLKKCVCLDDKERWSPQQLLKHSFIN 

PQPKMPLVEQSPEDSGGQDYVETVIPSNRLPSAAFFSETQRQFSRYFIEFEELQLLGKGA 

FGAVI KVQNKLDGCCYAVKRI PINPASRQFRRI KGEVTLLSRLHHENI VRYYNAWI ERHE 

RPAGPGTPPPDSGPLAKDDRAARGQPASDTDGLDSVEAAAPPPILSSSVEWSTSGERSAS 

ARFPATGPGSSDDEDDDEDEHGGVFSQSFLPASDSESDIIFDNEDENSKSQNQDEDCNEK 

NGCHESEPSVTTEAVHYLYIQMEYCEKSTLRDTIDQGLYRDTVRLWRLFREILDGLAYIH 

EKGMIHRDLKPVNIFLDSDDmnCIGDFGIATDHlAFSADSKQDDQTGDLIKSDPSGHLTG 

^^GTALYVS PEVQGSTKS AYNQKVDLFSLGI I FFEMS YHPMVTASERI FVIiNQLRDPTS P 

KPPEDFDDGEHAKQKSVISWLLNHDPAKRPTATELLKSELLPPPQMEESE 

NVDGKAYRTMMAQIFSQRISPAIDYTYDSDILKGNFSIRTAKMQQHVCETIIRIFKRHGA 

VQLCTPLLLPRiniQ I YEHNEAALFMDHSGMLVMLPFDLRI PFARYYARNNI LNLK^ 

RVFRPRKLDRFHPKELLECAFDIVTSTTNSFLPTAEIIYTIYEIIQEFPALQERNYSIYL 

NHT^!□aLKAILLHCGIPEDKLSQVYIILYDAVTEKLTRREVEAKFCNLSLSSNSLCRL^ 

lEQKGDLQDLMPTINSLIKQKTGIAQLVKYGLKDLEEVVGLLKKLGIKLQVLINLGLVYK 

VQQHNGI I FQFVAF I KRRQRAVPEI LAAGGRYDLLI PQFRGPQALGPVPTAIGVS I AIDK 

ISAAVI^EESVTISSCDLLWSVGQMSMSRAINLTQKLWTAGITAEIMYDWSQSQEELQ 

EYCRHHEITYVALVSDKEGSHVKVKSFEKERQTEKRVLETELVDHVLQKLRTKVTDE^ 

REASDNLAVQNLKGSFSNASGLFEIHGATVVPIVSVIiAPEKLSASTRRRYETQVQTRLQT 

SliANLHQKSSEIEIIiAVDLPKETILQFLSLEWDADEQAFNTTVKQIJ^SRLPKQRYLK^ 

DEIYNIKVEKKVSVLFLYSYRDDYYRILF 

SEQ ID NO: 192_R43524_H, HRI_H 

MI/5C»JSGVRKREEEGDGAGAVAAPPAIDFPAEGPDPEYDESDVPAEIQVLKEPLQQPTFP 

FAVANQLLLVSIiLEHLSHVHEPNPLRSRQVFKLLCQTFIKMGLLSSFTCSDEFSSLRLHH 

NRAITHLMRSAKERVRQDPCEDISRIQKIRSREVALEAQTSRYLNEFEELVILiGKGGYGR 

WKVRNKLDGQYYAI KKILIKGATKWCMKVLREVKVLAGLQHPNI VGYHTAWI EHVHVI 

QPRADRAAIELPSLEVLSDQEEDREQCGVKNDESSSSSIIFAEPTPEKEKRFGESDTENQ 

NNKSVKYTTl^VIRESGELESTLELQENGIAGLSASSIVEQQLPIiRRNSHIi^ 

SSEENVNFLGQTEAQYHLiMLHIQMQLCELSLWDWIVERNKRGREYVDES 

KIFQELVEGVFYIHNMGIVHRDLKPRNIF]^GPDQQVKIGDFGLACTDILQKNTDW 

GKRTPTHTSRVGTCLYASPEQLEGSEYDAKSDMYSLGWLLELFQPFGTEMERAEVLTGL 

RTGQLPESIiRKRCPVQAKYIQHLTRRNSSQRPSAIQLLQSELFQNSGNVNLTLQMKIIE^ 

EKEIAELKKQLNLLSQDKGVRDDGKDGGVG 

SEQ ID NO: 193_17000057519457_H 

MAAARATTPADGEEPAPEAEALAAARERSSRFLSGLELVKQGAEARVFRGRFQGRAAVIK 
HRFPKGYRHPALEARLGRRRTVQEARALLRCRRAGI SAPWFFVDYASNCLYMEEI EGSV 
TVRDYIQSTMETEKTPQGLSNLAKTIGQVLARMHDEDLIHGDLTTSNMLLKPPLEQI^ 
LIDFGLSFISALPEDKGVDLYVLEKAFLSTHPNTETVFEAFLKSYSTSSKKARPVLKKLD 

EVRLRGRKRSMVG 

SEQ ID NO: 194_AA013524_M 

LVQQGAEARVFRGRFQGRAAWKHRFPKSYRHPELEARLGRRRTVQEARALLRCRRAGIA 
APWFFVDYASNCLYMEEIEDSVTVRDYIQSTMETEKDPQCLLDLARRMGQVLAGM 
LIHGDLTTSNMLLRRPIiAQLHIVLIDFGLSFVSGLPEDKGVDLYVLEKAFLSTHPHTETA 
FEAFLKSYGASSKK5SPVLKKLDEVRLRGRKRSMVG 
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SEQ ID NO: 195_1700013 9801197_H, IRAKM_H 

MAGNCGARGALSAHTLLFDLPPALLGELCAVLDSCDGAIX^WRGIAERLS S S WL^ 

yVDQGKSGTRELLWSWAQKNKTIGDLLQVUJEMGHRRAIHLITNYGAVLSPSEKSYQ 

FPNILFKETANVTVDNVLI PEHNEKGVLLKSS I SFQWI lEGTRNFHK^ 

RVEIQNLTYAVKLFKQEKKMQCKKHWKRFLSELEVLLLFHHPNILELAAYFTETEK^^ 

y PYMRNGTLFDRLQCVGDTAPLPWHI RI GI LIGI S KAIHYLHNVQPCS VI CGS I S SANI L 

LDDQFQPKLTDFANAHFRSHLEHQSCTINMTSSSSKHLVmiPEEYIRQGKLSIKTDVYSF 

GIVIMEVLTGCRVVLDDPKHIQLRDLLRELMEKRGLDSCLSFLDKKVPPCPR^ 

IAGRCAATRAIaJRPS^^DEVLNTLESTQASLYFAEDPPTSLKSFRCPS PLFLENVPS I PVE 

DDESQNimLLPSDEGIiRIDRMTQKTPFECSQSEVMFLSLDKKPESKRNEEAaS^ 

ESWFPKYIVPSQDLRPYKVNIDPSSEAPGHSCRSRPVESSCSSKFSWDEYEQYKKE 

SEQ ID NO: 196_AA840598_M IRAKM_M 

MWKRFLSELEVLLLFRHPHILELAAYFTETEKLCLVYPYMSNGTLFDRLQCTNGTTPLSW 

HWISVLIGIAKAIQYLHNTQPCAVICGNVSSANILLDDQLQPKLTDFAAAHF 

SSTINMTGGGRKHLWYMPEEYIRQGRLSVKTDVYSFGIVIMEVLTGCKVVLD^ 

DLLMELMEKRGLDSCLSFLDRKIPPCPRNFSAKLFSIAGRCVATK^ 

STQPSLYFAEDPPTSLKSFRCPSPLFLDNVPSIPVEDDENQNNHSVPPKEVLGTDRVTQK 

TPFECSQSEVTFLGIiDRNRGNRGSEADCNVPSSSHEECWSPELVAPSQDLSPTVISIiGSS 

wevpghsygskpmekrcssglfcseheqskkq 
seq id no: 197_aa088547_h 

masavrgsrpwprlglqlqfaalllgtlspqvhtlrpenlllvstldgslhalskqtgdl 
kwtlrddpviegpmyvtei^flsdpadgslyilgtqkqqglmklpftipelvh;^ 
dgvf^tgrkqdawfvvdpesgetqmtlttegpstprlyigrtqytvtnwdprapalrwnt 
tyrrysappmdgspgkymshuvscgmgllltvdpgsgtvlwtqdlgvpvmgv^ 

RQLPHLTLARDTLHFIiALRWGHIRLPASGPRDTATLFSTLDTQLLMTLYVGKDETGFYVS 

KALVHTGVALVPRGLTLAPADGPTTDEVTLQVSGEREGSPSTAVRYPSGSVALPSQWLLI 

GHHELPPVLHTTMLRVHPTLGSGTAETRPPENTQAPAFFLEIjLSLSREKLWDSELHPEEK 

TPDSYLGLGPQDIiliAASLTAVLLGGWILFVMRQVVEKQQETPLAPADFAHISQDAQSLHS 

GASRRSQKRLQSPSKQAQPLDDPEAEQLTWGKISFNPKDVLGRGAGGTFVFRGQFEGRA 

VAVKRLLRECFGLVRREVQLLQESDRHPNVLRYFCTERGPQFHYIALELCRASLQEYVEN 

PDLDRGGLEPEVVLQQLMSGLAHLHSLHIVHRDLKPGNILITGPDSQGLGRVVLSDFGLC 

KKLPAGRCSFSLHSGI PGTEGWMAPELLQLLPPDSPTSAVDI FSAGCVFYYVLSGGSHPF 

GDSLYRQANILTGAPCIAHLEEEVHDKWARDLVGAMLSPLPQPRPSAPQVIAHPFFWSR 

AKQLQFFQDVSDWLEKESEQEPLVRALEAGGCAVVRDNWHEHISMPLQTDLRKFRSYKGT 

SVRDLLRAVRNKKHHYRELPVEWQALGQVPDGFVQYFTNRFPRLLLHT 

LFLPYYPPDSEARRPCPGATGR 

SEQ ID NO: 198_HGP_6644466 

MEGISNFKTPSKLSEKKKSVLCSTPTINIPASPFMQKLGFGTGVNVYLMKRSPRGLSHSP 

WAVKKINPICNDHYRSWQKRIi^ClEAKILKSLHHPNIVGyRAFTEANDGSLCI^ 

KSLNDLIEERYKASQDPFPAAIILKVALNMARGLKYLHQEK^ 

Tl KI CDVGVSLPLDENMTVTDPEACYIGTEPWKPKEAVEENGVITDKADI FAFGLTLWEM 
MTLSIPHINLSNDDDDEDKTFDESDFDDEAYYAALGTRPPINMEELDESYQKVIELFSVC 
TNEDPKDRPSAAHIVEALETDV 

SEQ ID NO: 199_AA449542_M 

SPRGLSHSPWAVKKISLLCDDHYRTVYQKRLTDEAKILKNLNHPNIIGYRAFTEASDGSL 
CLAMEYGGEKSLNDLIEERNIODSGSPFPAAVILRVALHMARGLKYLHQEK^ 
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FIGURE IT 

NWIKGDFETIKICDVGVSLPLDENMTVTDPEACYIGTEPWKPKEALEENGIITDKADV^ 

AFGLTLWE^IMTLCIPHVNLPDDDVDEDATFDESDFDDEAYyAALGTRPSINMELDD^ 

AIELFCVGTNEDPKDRPSAAHIVEALELDGQCCGLSESKH 

SEQ ID NO: 200_5R57_10_2_M TESK2_M 
LLDSDLYLPWTVRVKLAYGIAVGLSYLHFKGIFHRDLTSKV. 

SEQ ID NO: 201_AA232253_H 

MSSLGASFVQIKFDDLQFFENCGGGSFGSVYR/^ISQDKEVAVKKLLKIEKEAEILSVL 
SHRNIIQFYGVILEPPNYGIWEYASLGSLYDYINSNRSEEMDMDHIMTWATDVAKGMHY 
IiHMEAPVKVI HRDLKSRNWI AADGVLKI CDPGASRFHNHTT™ 

LPVSETCDTYSYGVVLWEMLTREVPFKGLEGLQVAWLVVEKNERLTIPSSCPRSFAELLH 

QCWEADAKKRPSFKQIISILESMSNDTSLPDKCNSFLHNKAEWRCEIEATLERLKKLERD 

LSFKEQELKERERRLKMWEQKLTEQSNTPLLPSFEIGAWTEDDVYCWVQQLVRKGDSSAE 

MSVYASLFKENNITGKRLLLLEEEDLKDMGIVSKGHIIHFKSAIEKLTHDYINLFHFPPL 

IKDSGGEPEENEEKIVNLELVFGFffljKPGTGPQDCKWKiyrraEMDGDEIAITYira 

NLPDAEILKIvrrKPPFVMEKWIVGIAKSQTVECIVTYESDVRTPKST^^ 

DEVKAVQIAIQTLFTNSDGNPGSRSDSSADCQWLDTLRMRQIASNTSLQRSQSNPILGSP 

ffshfdgqdsyaaavrrpqvpikyqqitpvnqsrsssptqygiitknfsslhlnsrdsgfs 
sgntdtssergrysdrsrnkygrgsislnssprgrysgksqhstpsrgrypgkfyrvsqs 
alnphqspdfkrsprdlhqpntipgmplhpetdsraseedskvseggwtkveyrkkphrp 

SPAKTNKERARGDHRGWRNF 

SEQ ID NO: 202_AI375137_H 

MGNYKSRPTQTCTDEWKKKVSESYVITIERLEDDI^IKEKELTELRNIFGSDEAFSKVNL 
NYRTENGLSLLHLCCI CGGKKSHI RTIiMLKGIiRPSRLTRNGFTALHIAVYra 
LHSGADIQQVGYGGLTALHIATIAGHLEAADVLLQHGANVNIQDAVFFTPLHIAAYYGHE 
QVTRLLLKFGADVNVSGEVGDRPLHLASAKGFLNIAKLLMEEGSKADVN^^ 
FCSRFGHHDI VKYIiLQSDLEVQPHVVNI YGDTPLHIiACYNGKFEVA^ IQI SGTESLTK 

enifsetafhsactygksidlvkfujdqnvininhqgrdghtgiihsacyhghirlvqfl^ 
dngadmnlvacdpsrssgekdeqtclmwayekghdaivtllkhykrpqdelpcneysqpg 
gdgsyvsvpsplgkiksmtkekadiiililiraglpshfhlqlseiefheiigsgsfgkvyk^ 

RCRNKI VAI KRYRANTYCS KSDVDMFCREVS I LCQLNHPCVI QFVGACLND P SQFAI 

YISGGSLFSLLHEQKRILDLQSKLI IAVDVAKGMEYLHNLTQPI IHRDLNSHNILLYEDG 

HAWADFGESRFLQSLDEDNMTKQPGNLRWMAPEVFTQCTRYTIKADVFSYALCLWEILT 

GEIPFAHLKPAAAAADMAYHHIRPPIGYSIPKPISSLLIRGWNACPEGRPEFSEWMKLE 

ECLOsriELMSPASSNSSGSLSPSSSSDCLVNRGGPGRSHVAALRSRFELEYAmARSYAA 

LSQSAGQYSSQGLSLEEMKRSLQYTPIDKYGYVSDPMSSMHFHSCRNSSSFEDSS 

SEQ ID NO: 203_H97685_H 

MESERSPLYRQLIDLGYLSSSHWNCGAPGQDTKAQSMLVEQSEKLRHLSTFSHQVLQTRL 

VDAAKALNLVHCHCLDIFINQAFDMQRDLQITPKRLEYTRKKENELYESI^ 

MKDMIVETLNTMKEELLDDATNMEFKDVIVPENGEPVGTREIKCCIR 

VANKLISSVDYLRESFVGTLERCLQSLEKSQDVSVHITSNYLKQILNAAYHVEVTFHSGS 

SVTRMLWEQIKQI IQRITVn^SPPAITLEWKRK^AQEAIESLSASKLAKS ICSQFRTRI^ 

SHEAFAASLRQLEAGHSGRLEKTEDLWLRVRKDHAPRLARLSLESRSLQDVLLHRKPKLG 

QELGRGQYGVVTLCDNWGGHFPCALKSWPPDEKHWNDLALEFHYMRSLPKHERLVDLHG 

SVIDYNYGGGSSIAVLLIMERLHRDLYTGLKAGLTLETRLQIALDVVEGIRFLHSQGLVH 

RDIKLKNVLLDKQNRAKITDLGFCKPEAI^SGSIVGTPIHMAPELFTGKYDNSVD 
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FIGURE lU 



ILFWYICSGSVKLPEAFERCASKDHLVnsnSFVRRGARPERLPWDEECWQI^^ 
RPLLGIVQPMLQGIMNRLCKSNSEQPNRGLDDST 

SEQ ID NO: 204_W20810_M 

DWLKASKASDVYSFGILWAVLiAGREAELVDKTSLIRETVCDRQSRPPLTELPPGSPET 

PGLEKLKELMIHCWGSQSEI^PSFQDCEPKTNEVYNLVKDKVDAAV^ 

RNLSAREPSQRGTEmCPRETlWSKMLDRLHLEEPSGPVPGKCPERQAQDTSVGPATPAR 

TSSDPVAGTPQIPHTLPFRGTTPGPVFTETPGPHPQRNQGDGRHGTPWYPWTPPNPMTGP 

PALVFNNCSEVQIGNYNSLVAPPRTTASSSAKYDQAQFGRGRGWQPFHK 

SEQ ID NO: 205_AA744236_H 

MGSENSALKSYTLREPPFTLPSGIAVYPAVLQDGKFASVF\nfKRENEDKVNKAAK^ 

RHPCLLRFLSCTVEADGIHLWERVQPLEVALETLSSAEVCAGIYDILLALIFLHDRGHL 

THNNVCLSSVFVSEDGHWKLGGl^TVCKVSQATPEFLRSIQSIRDPASIPPEEMSPEFTT 

LPECHGHARDAFSFGTLVESLLTILNEQVSADVLSSFQQTUISTLLNPIPKCRPALCTLiL 

SHDFFRNDFLEWNFLKSLTLKSEEEKTEFFKFLLDRVSCLSEELIASRLVPLLLNQLVF 

AEPVAVKSFLPYLLGPKJCDHAQGETPCLLSPALFQSRVIPVLLQLFEVHEEHVRMVLLSH 

lEAYVEHFTQEQLKKVILPQVLLGLRDTSDSIVAITLHSLAVLVSLLGPEVVVGGERTKI 

FKRTAPSFTKNTDLSLEGDPFSQPIKFPINGLSDVKNTSEDSENFPSSSKKSEEWPDWSE 

PEEPENQTVNIQIWPREPCDDVKSQCTTLDVEESSWDDCEPSSLDTKVNPGGGITATKPV 

TSGEQKPIPALLSLTEESMPWKSSLPQKISLVQRGDDADQIEPPKVSSQERPLKVPSELG 

LGEEFTIQVKKKPVKDPEMDWFADMIPEIKPSAAFLILPELRTEMVPKKDDVSPVMQFSS 

KFAAAEITEGEAEGWEEEGELNWEDNNW 

SEQ ID NO: 206_AI052250_H 

MESMIiNKLKSTVTKVTADVTSAVMGI PVTREFDVGRHI ASGCNGLAWKI FNGTKKSTKQE 
VAVFVFDKKLIDKYQKFEKDQIIDSLKRGVQQLTRLRHPRLLTVQHPLEESRDCIAFCTE 
PVFASLANVLGNWENLPS P I S PD I KDYKLYDVETKYGLLQVSEGLSFLHS S VKMVHGNI T 
PENIILNKSGAWKIMGFDFCVSSTNPSEQEPKFPCKEWDPNI.PSLCLPNPEYIAPEYILS 
VSCETASDMYSLGTVMYAVFNKGKPIFEVNKQDIYKSFSRQLDQLSRLGSSSIiTNIPEEV 
REHVKLLLNVTPTVRPDADQMTKIPFFDDVGAVTLQYFDTLFQRDNIiQKSQFFKGLPi^ 
PKLPKRVIVQRILPCLTSEFVNPDMVTFVLPNVLLIAEECTKEEYVKLILPELGPVFKQQ 
EPIQILLIFIiQKMDLLLTKTPPDEIKNSVLPMVYRALEAPSIQIQELCIiNIIPTFANLID 
YPSMKNALIPRIKNACYKHLPLRFV 

SEQ ID NO: 207_AA278842_H 

>IWFFARDPVRDFPFELI PE PPEGGLPGPWALHRGRKKATGS PVS I FVYDVKPGAEEQTQV 

AKAAFKRFKTLRHPNILAYIDGLETEKCLHVVTEAVTPLGIYLKARVEAGG^^ 

LHQIVKALSFLVNDCSLIHNNVCMAAVFVDRAGEWKLGGLDYMYSAQGNGC^ 

LEQYDPPELADSSGRWREKWSADMWRLGCLIWEVFNGPLPRAAALRNPGKIPKTLVPHY 

CELVGANPKVRPNPARFLQNCRAPGGFMSNRFVETNLFLEEIQIKEPAEKQKFFQELSKS 

LDAFPEDFCRHKVLPQLLTAFEFGNAGAWLTPLFKVGKFLSAEEYQQKI I PWVKMFSS 

TDRAMRIRLLQQMEQFIQYLDEPTVNTQIFPHVVHGFLDTNPAIREQT^ 

EANLNVELMKHFARLQAKDEQGPIRCNTTVCLGKIGSYLSASTRHR^ 

APSRVAGVLGFAATHNLYSMNDCAQKILPVLCGLTVDPEKSVRDQAFKAIRSFLSKLESV 

SEDPTQLEEVEKDVHAASSPGMGGAAASWAGWAVTGVSSLTSKLIRSHPTTAPTETNIPQ 

RPTPEGVPAPAPTPVPATPTTSGHWETQEEDKDTAEDSSTADRWDDEDWGSLEQEAESVL 

AQQDDWSTGGQVSRASQVSNSDHKSSKSPESDWSSWEAEGSWEQGWQEPSSQEPPPDGTR 

LASEYNWGGPESSDKGDPFATLSARPSTQPRPDSWGEDNWEGLETDSRQVKAELARKKR^ 

ERRREMEAKRAERKVAKGPMKLGARKLD 
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FIGURE IV 



SEQ ID NO: 208_AA599286_H 

MAFMEKPPAGKVLLDDTVPLTAAIEASQSLQSHTE YI I RVQGGI SVENSWQI VRRYSDFD 

LLNNSLQIAGLSLPLPPKXLIG^IMDREFIAERQKGLQNyLWIT^ 

PNNYSANYTEIALQQVSMFraSEPKWEVVEPLKDIGWRIRKKYFIiMKIKI^QPKE^ 

ADLGPDKYLSDKDFQCLIKLLPSCLHPYIYRVTFATANESSALLIRMFNEKGTLKDLIYK 

AKPKDPFLKKYCNPKKIQGLELQQIKTYGRQILEVLKFLHDKGFPYGHLHASNVM^ 

CRLLDLENSLLGLPSFYRSYFSQFRKINTLESVDVHCFGHLLYEMTYGRPPDSVPVDSFP 

PAPSMAWAVLESTLSCEACKNGMPTI SRLLQMPLFSDVLLTTSEKPQFKI PTKLKEALR 

lAKECIEKRLIEEQKQIHQHRRLTRAQSHHGSEEERKKRKILARKKSKRSALENSEEHSA 

KYSNSNNSAGSGASSPLTSPSSPTPPSTSGISALPPPPPPPPPPAAPLPPASTEAPAQLS 

SQAVNGMSRGALLSSIQNFQKGTLRKAKPVITVLRRSAEASCLHLEGKVLFYSYSPLPPN 

YPLPGKVIAEPVQPQTVLFCRCSCKQLFERNNSLSRIKLGWHAKKKKKK 

SEQ ID NO: 209_AA425725_^H 

MSASTGGGGDSGGSGGSSSSSQASCGPESSGSELALATPVPQMLQGLLGSDDEEQEDPKD 

YCKGGYHPVKIGDVTNGRYHVWKLGWGHFSTVWLCWDIQRKRFVALKVVKSAGOT 

VDEIKLLKCVRDSDPSDPKRETIVQLIDDFRISGVNGVHVCMVLEVLGHQLLI^ 

QGLPVPCVKSIVRQVIiHGLDYLHTKCKIIHTDIKPENILLCVGDAYIRRI^^ 

GAPPPSRSIVSTAPQEVLTGKLSKNKRKKMRRKRKCXJKRLLEERLRDLQRLEA 

EDSGIiRLDGGSGSTSSSGFSGSLFSPASCSILSGSSNQRETGGLLSPSTPFGASNLLVNP 

LEPQNADKIKTKIADIiGNACWVHKHFTEDIQTRQYRAVEVLIGAEYGPPADIWST^ 

EIiATGDYLFEPHSGEDYSRDEDHIAHIVELLGDIPPAFALSGRYSREFFNRRGELRHIHN 

LKHWGLYEVLMEKYEWPLEQATQFSAFLLPMMEYIPEKRASAADCLQHPWLNP 

SEQ ID NO: 210_SGK022_H 

MEDFLLSNGYQLGKTIGEGTYSKVKEAFSKKHQRKVAIKVIDKMGGPSEFIQRFLPRELQ 
IVRTLDHKNIIQVYEMLESADGKICLVMELAEGGDVFDCVLNGGPLPESRAK^ 
AIRYCHGCGVAHRDLKCENALLQGFNLKLTDFGFAKVLPKSHRELSQTFCGSTAYAAPEV 
LQGIPHDSKKGDVWSMGVVLYVMLCASLPFDDTDIPKMLWQQQKGVSFPTHLSISADCQD 
LLKRLLE PDMI LRPS I EEVS WHPWLAST 

SEQ ID NO: 211_AA060026_M SGK022_M 

MEDFLLSNGYQLGKTIGEGTYSKVKEAFSKKHQRKVAIKIIDKMGGPEEFIQRFLPRELQ 
IWTLDHKNIIQVYEMLESADGKIYLVMELAEGGDVFDCVIiNGGPLPESRAK;^ 
AIRYCHGCGVAHRDLKCENALLQGFNLKLTDFGFAKVLPKSRRELSQTFCGSTAYAAPEV 
LQGIPHDSKKGDVWSMGVVXYVI^CASLPFDDTDIPKMLWQQQKGVSFPTHLGISTECQ^ 
LLKRLLEPDMI LRPS I EEVS WHPWLAST 

SEQ ID NO: 212_AA3 99669_H 

MGKGDVLEAAPTTTAYHSLMDEYGYEVGKAI GHGSYGS VYEAFYTKQKVMVAVKI I S KKK 

ASDDYLNKFLPREIQQVMKVLRHKYLINFYRAIESTSRVYIILELAQGGDVLEWIQRYGA 

CSEPLAGKWFSQLTLGIAYLHSKSIVHRDLKLENLLLDKWENVKISDFGFAK^^ 

GCSPXYRQVNCFSHLSQTYCGSFAYACPEILRGLPYNPFLSDTWSMGVILYTLWAHLPF 

DDTNLKKLLRETQKEVTFPANHTISQECKVQLLIACVAQWRKTQARPLSPLL 

SEQ ID NO: 213_AA758539_H 

MDDATVLRKKGYIVGINLGKGSYAKVKSAYSERLKFNVAVKIIDRRKTPTDFVERFLPRE 
MDILATVNHGSIIKTYEIFETSDGRIYIIMELGVQGDLLEFIKCQGALHEDVARKMFRQL 
SSAVKYCHDLDIVHRDLKCENLLLDKIDFNIKLSDFGFSKRCLRDSNGRIILSKTFCGSAA 




wo 00/73469 



PCTAJSOO/14842 



FIGURE IW 



YAAPEVLQSIPYQPKVYDIWSLGVILYIMVCGSMPYDDSDIRKMLRIQKEHRVDFPRSKN 
LTCECKDLIYRMLQPDVSQRLHIDEILSHSWLQPPKPKATSSASFKREGEGKYRAECKLD 
TKTGLRPDHRPDHKLGAKTQHRLLWPENENRMEDRLAETSRAKDHHI SGAEVGKAST 

SEQ ID NO: 214_AA883975_H 

MSGDKLLSELGYKLGRTIGEGSYSKVKVATSKKYKGTVAIKVVDRRRAPPDFVNK^ 

LSILRGVRHPHIVHVFEFIEVCNGKLYIVMEAAATDLLQAVQRNGRIPGVQARDLFAQIA 

GAWYLHDHHLVHRDLKCENVLLSPDERRVKLTDFGFGRQAHGYPDLSTTYCGSAAYASP 

EVLLGIPYDPKKYDWSMGVVLYVMVTGCMPFDDSDIAGLPRRQKRGVLYPEGLELSERC 

KALIAELLQFSPSARPSAGQVARNCWLRAGDSG 

SEQ ID NO: 215_AA905446_H 

VGRQETGVRRWAFLICQPISPPLTSSEFIQRFLPRELQIVRTLDHKNIIQVYEMLESADG 
KICLVMELJ^GGDVFDCT^n^GGPLPESRAKALFRQMVEAIRYCH^ 

QGFNLKLTDFGFAKVLPKSHRELSQTFCGSTAYAAPEVLQGIPXKMLWQQQKGVSFPTHL 
S I SADCQDLLKRLLEPDMILRPS I EEVSWHPWLAST 

SEQ ID NO: 216_H29974_H 

YSLLAEIGRGSYGVWEAVAGRSGARVAVKKIRCDAPENVEUUliAEFWALTS 
VQFEECVLQRNGIiAQRMSHGNKSSQLYLRLVETSLKGERILGYAEEPCYLWFVMEFCEGG 
DLNQYVLSRRPDPATNKSFMLQLTSAIAFLHKNHIVHRDLKPDNILITERSGTPILKV^ 
FGLSKVCAGLAPRGKEGNQDNKNVNVNKYWLS SACGSDFYMAPEVWEGHYTAKADI FALG 
IIIWAMIERITFIDSETKKELLGTYIKQGTEIVPVGEALLENPKMELHIPQKRRTSMSEG 
I KQLLKDMIiAANPQDRPDAFELETRMDQVTCAA 

SEQ ID NO: 217_AA4 98104_M H29974_M 

PLLLPPPPAAl^TGKENGARRGTKSPERKRRSPVQRVLCEKLRPAAQAMDPAGAEVPGEA 

FLARRRPDGGGGDVPARPRYSLLAEIGRGSYGVVYEAVAGRSGARVAVKKIRCDAPENVE 

LALAEFWALTSLKRRHQNIVQFEECVLQRNGLiAQRMSHGNKNSQLYLRLVET 

GYAEEPCYLWFVMEYCEGGDLNQYVLSRRPDPATNKSFMLQLTSAIAFLHKNHIVHRDLK 

PDNILITERSGTPILKVADFGLSKVCAGLAPRGKEGNQDNKNVNVNKYWLSSACGSDFYM 

APEVWEGHYTAKADIFALGIIIWAMIERITFIDSETKKELLGTYIKQGTEIVPVGEALLE 

NPKMELHIPQKRRTSMSEGVKQLLKIDMLAANPQDRPDAFELETRMDQVTCAA 

SEQ ID NO: 218_AA215311_H 

MVSSQPKYDLIREVGRGSYGVVYEAVIRKTSARVAVKKIRCHAPENVELALREFWA^ 

KSQHPNVIHLEECILQKDGMVQKMSHGSNSSLYIiQLVETSLKGEIAFDPRSAYYLWFVMD 

FCDGGDMNEYLLSRKPNRKTNTSFMLQLSSALAFLHKNQIIHRDLKPDNILI 

DLEPTLKVADFGLSKVCSASGQNPEEPVSVNKCFLSTACGTDFYMAPEVWEGHYTAKADI 

FALGIIIWAMLERITFIDTETKKELLGSYVKQGTEIVPVGET^LENPKMELLIPVKXKSM 

NGRMKQLIKEMLAANPQDRPDAFELELRLVQIAFKDSSWET 

SEQ ID NO: 219_AA018361_H 

MRAAFPAGGAGGSVEPPSARPAPQPAGTAARSEEAPARAQAAGMAGPGWGPPRLDGFILT 

ERLGSGTYATVYKAYAKKDTREWAIKCVAKKSIiNKASVENLLTEIEIL^^ 

KDFQWDSDNIYLIMEFCJ^GGDLSRFIHTRRILPEKVARVFMQQIiASALQFLHERNISHLD 

LKPQNILLS SLEKPHLKLADFGFAQHMS PWDEKHVLRGSPLYMAPEMVCQRQYDARVDLW 

SMGVILYEALFGQPPFASRSFSELEEKIRSNRVIELPLRPLLSRDCRDLLQRLLERDPSR 

RISFQDFFAHPWVDLEHMPSGESLGRATALWQAVKKDQEGDSAAALSLYCKALDFFVPA 
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FIGURE IX 

LHYEVDAQRKEAIK7U<VGQYVSRAEELKAIVSSSNQALLRQGTSARDLI^^ 
AALEVASAAMAKEEAAGGEQDALDLYQHSLGELLLLLRSPRAGGGSCFTLRFRTSWPELN 

T 



seq id no: 220_aa311714_h 

menfilyeeigrgsktvvykgrrkgtinfvailctdkcrrpeitnvmilto^ 

fhewyetsnhlwlvxenlpedwrefgidlisglhhlhklgilfcdisprkillegpgtl 

kfsnfclakvegenleeffalvaaeegggdngenvlkksmksrvkgspvytapevvrg;^ 

FS I s sdlwslgcllyemfsgkppffses vselteki lcedplpp I pkds srpkas sdf in 

llixsllqrdpqkrltwtrllqhsfwkkafagadqessvedlslsrot 

qnsqsrqakghksgqplghsfrlenptefrpkstlegqlnesmfllssrptprtstavev 

spgedmthcspqktspltkitsghlsqqdlesqmreliytdsdlvvtpiidnpkimkqpp 

vkfdakiijilptysvdkllflkdqdwndflqqvcsqidsteksmgasrak^ 

aghqevatrllhsplfqlliqhlriapnwdirakvahvigllashttelqentpwetts 

sigigilnclvqhstpvprqclvyv 

SEQ id NO: 221_SGK384_H 

S IiAHVLRARQI ltepevrdylrglvsglrylhqrc I lhr 

SEQ ID NO: 222_AA210451_M SGK384_M 

MGQQHGTRNGLTHRELPRGVGUjLAMALMNVALYLCLDQLFISPGRSTADSRRCPPGYFR 

MGRMRNCSRWLSCEELRTEVRQLKRVGEGAVKRVFLSEWKEHKVALSRLTRLEMKEDFLH 

GLQMLKSLQSEHVVTLVGYCEEDGTILTEYHPLGSLSNLEETLNLSKYQDVNTWQHRLQL 

AMEYVSIINYLHHSPLGTRVMCTSNDLPKTLSQYLLTSNFSIVANDU3^ 

IKCGHRELHGDFVAPEQLWPYGEDTPFQDDLMPSYNEKVDIWKIPDVSSFLLGHVEGSDM 

VRFHLFDIHKACKSQIPAERPTAQNVLDAYQRVFHSLRDTVMSQTKEML 

SEQ ID NO: 223_SGK071_2_H 

EWAVQMMVEC^©DHYASQALEELMPLLKLRHAHI SWQELF ITWNGEI SS 
NELSFQEVIEDKRKAKKI IDSEWMQNVLGQVLDALEYLHHLDI IHRNLKI>SNI ILI SS 
CKLQDLSSNVLMTDKAKWNIRAEEDPFRKSW^4APEALNF^ 

SFMDGTEAMHLRKSLRQS PGSLKAVLKTMEEKQ I PDVETFRNLLPLMLQIDPSDRI ti kd 
WHITFLRGSFKSSCVSLTIiHRQMVPASITDMLLEGNVASILGDAGDTKGERALKLLSMA 
LASYCLVPEGSLFMPLALLHMHDQWLSCDQDRVPGKRDFASLGKLGKLLGPI^ 

elvevvvttmelhdrvldvqlgacslllhllgq;^ 

PEEEPLLVMVYSLLAITTTQESESLSEELQNAGLLEHILEHLNSSLESRDVCASGLGLLW 
ALLLDDPILALQRPRKKRAPNHGKPGKPKNPASTQSIIWKAPLEKVPDLISQV^ 
DGEMAEASCGVFWLLSLLGCIKEQQFEQVVALLLQSIRLCQDRALLVNNAYRGLASLVKV 
SELAAFKVWQEEGGSGLSLI KETYQLHRDDPEVVENVGMLLVHIiAS YE I LPELVSS SM 

kallqeikerftsslvsdssafskpglppggspqlgcttsggle 
seq id no: 224_aa118352_m sgk071_m 

eedpcqkswmapealkfsfstksdiwslgciildmatcsflndteamqlrkairhhpgsl 
KP ilktmeekqi pgtdvyylllpfmlhinpsdrlai kdvmqvtfmsns fks s s valnmqr 
QKVP I f I tdvllegnmani lgswlcasfvndsrhcdsgi gsqrlgfdfqsvswtehplkd 
vmqnfssrpevqlrainklltmpedqlglpwptelleevi S I ikqhgrild illstcsll 
LRVLGQALAKDPEAE I PRSSLI I sflmdtlrshpnserlvnvvynviai I ss^^^ 
eeeglfqlaqenlehfqedrdiclsilsllwsllvdvvtvdkepleqlsgmvtwla 
edveiaeagcavlwllsllgcikesqfeqvwlllrsiqlcpgrvllvnnafrglaslak 

^W I/) 
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VSELVAFRIVVLEEGSSGLHLIQDIYKLYKDDPEVVENLCMLLAHLTSYKEILPE^^ 
IKDLVQVIRGRFTSSLELISYADEILQVLEANAQPGLQEDQLEPPAGQEAPLQGEPLFRP 

SEQ ID NO: 225_018653.9_H 

GRGRGAGHARGLGRGPAGRRAEPPRSLSRPGPGPGSRAGPAGRGEGSDAAPAGGSGRGFL 
RLLPAGLRPQRALRSGSEPPRPGQSPEPSPAPGAGRRGGRGELARQIRARYEEVQRYSRG 
GPGPGAGRPERRRLMDLAPGGPGLPRPRPPWARPLSDGAPGWPPAPGPGSPGPGPRLGCA 
ALRNVSGAQYMGSGYTKAVYRWLPGGAAVALKAVDFSGHDLGSCVREFGVRRGCYRIiAA 
HKLLKEMVLLERLRHPNVLQLYGYCYQDSEDIPDTLTTITELGAPVEMIQLLQTSWEDRF 
RI CLS LGRLLHHLAHS PLGSVTLLDFRPRQFVLVDGELKVTDLDDARVEETPCAGSTDC I 
LEFPTU^FTLPCSAQGWCEGMNEKRNLYNAYRFFFTYLLPHSAPPSLRPLLDSIVNATGE 
LAWGVDETLAQLEKVLHLYRSGQYLQNSTASS STE YQCI PDSTI PQEDYRCWP S YHHGSC 
LLSVFNIiAEAVDVCESHAQCRAFVVTNQTTWTGRQLVFFKTGWSQWPDPNKTTY^ 

SEQ ID NO: 226_AA396601_M 

TRPGCAALRNVSGAQYVGSGYTKAVYRVRLPGGAAVALKAVDFSGHDLGSCVREFGARRG 
CYRLAAHKLLKEMVLLERLRHPNVLQLYGYCYQDSEGIPDTLTTITELGAPVEMIQLLQT 
SWEDRFRICLSLGRLLHHLAHSPLGSVTLLDFRPRQFVLWG 

SSADCTLEFPARNFSLPCSAQGWCEGMNEKRNLYNAYRFFFTYLLPHSAPPSLRPLLDSI 

VNATGELAWGVDETLAQLETALHLFRSGQYLQNSTSSRAEYQRIPDSAITQEDYRCWPSY 

HHGGCLLSVFNIiAEAIDVCESHAQCRAFVVTNQTTWTGRKLVFFKTGWN^ 

VKAPG 

SEQ ID NO: 227_VRK3_H 

MI S FCPDCGKS I QAAFKFCPYCGNSLPVEEHVGSQTFVNPHVS S FQGSKRGLNSS FETS P 

KKVKWSSTVTSPRLSLFSDGDSSESEDTLSSSERSKGSGSRPPTPKSSPQKTRKSPQVTR 

GSPQKTSCSPQKTRQSPQTLKRSRVTTSLEALPTGTVLTDKSGRQWKLKSFQTRDNQGIL 

YEAAPTSTLTCDSGPQKQKFSLKLDAKDGRLFNEQNFFQRAAKPLQVNKWKKLYSTPLLA 

IPTCMGFGVHQDKYRFLVLPSLGRSLQSALDVSPKHVLSERSVLQVACRLLDALEFLHEN 

EYVHGNVTAENIFVDPEDQSQVTIAGYGFAFRYCPSGKHVAYVEGSRSPHEGDLEFISMD 

LHKGCGPSRRSDLQSLGYOVILKWLYGFLPWTNCLPNTEDIMKQKQKFVDKPGPFVGPCG 

WIRPSETLQKYLKVVMALTYEEKPPYAMLRNNLEALLQDIiRVSPYDPIGLPMVP 

SEQ ID NO: 228_S71575__M VRK3_M 

IPTCIGFGIHQDKYRFLVFPSLGRSLQSALDDNPKHVVSERCVLQVACRLLDALEYLHEN 
EYVHGNLTAENVFVNPEDLSQVTLVGYGFTYRYCPGGKHVAYKEGSRSPHDGDLEFISMD 
IJIKGCGPSRRSDLQTLGYCMLKWLYGSLPWTNCLPNTEKITRQKQKYLDSPERLVGLCGR 
VmKASETLREYLKVVMALNYEEKPPYATLRNSLEALLQDMRVSPYDPLDLQM^ 

SEQ ID NO: 229__AA45427_H 

MGHALCVCSRGTVI IDNKRYLFI QKLGEGGFS YVDLVEGLHDGHFYALKRILCHEQQDRE 

EAQREADMHRLFNHPNI LRLVAYCLRERGAKHEAWLLLPFFKRGTLWNE I ERLKDKGNFL 

TEDQILWLLLGICRGLEAIHAKGYAHRDLKPTNILLGDEGQPVIJVIDLiGSMNQAC 

RQALTLQDWAAQRCTISYRAPELFSVQSHCVIDERTDVWSLGCVLYAMMFGEGPYDMVFQ 

KGDSVALAVQNQLSIPQSPRHSSALRQLLNSMMTVDPHQRPHIPLLLSQLEALQPPAPGQ 

HTTQI 

SEQ ID NO: 230_H05721_H 

MAVRQALGRGLQLGRALLLRFTGKPGRAYGLGRPGPAAGCVRGERPGWAAGPGAEPRRVG 
LGLPNRLRFFRQSVAGLAARLQRQFWRAWGCAGPCGRAVFLAFGLGLGLIEEKQAESRR 
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AVSACQEIQAIFTQKSKPGPDPLDTRRLQGFRLEEYLIGQSIGKGCSAAVYEATMPTLPQ 

NLEVTKSTGLLPGRGPGTSAPGEGQERAPGAPAFPIJ^IKNIiyn^ISAGSSSEAII^^ 

LVPASRVAIiAGEYGAVTYRKSKRGPKQLAPHPNIIRVLRAFTSSVPLLPGALVDYPDVLP 

SRLHPEGLGHGRTLFLVMKNYPCTLRQYLCVNTPSPRIiAAMMLLQLLEGVDHLVQQGIAH 

RDLKSDNILVEUDPDGCPV^VIADFGCCLADESIGLQLPFSSWYVDRGGNGCLMAPEVST 

ARPGPRAVIDYSKADAWAVGAIAYEIFGLVNPFYGQGKAHLESRSYQEAQLPALPESVPP 

DWQLVRALLQREASKRPSARVAAimiHLSLWGEHILALKNLKLDKM^ 

ANRLTEKCCVETKMKMLFLANLECETLCQAALLLCSWRAAL 

SEQ ID NO: 231_AI086865_H 

MEKYERIRWGRGAFGIVHLCLRKADQKLVIIKQIPVEQMTKEERQAAQNECQVLKLLNH 

P WI EYYENFLEDKALMI AMEYAPGGTLAEF I QKRCNSl^EETILHFi^QI LI^ 

THLILHRDLKTQNILLDKHRMVVKIGDFGISKILSSKSTPCYISPELCEGKPYNQKSDIW 

ALGCVLYELASLKRAFEAANLPALVLKIMSGTFAPISDRYSPELRQLVLSLLSLEPAQRP 

PLSHIMAQPLCIRALLNLHTDGREVRGPQQHREQDHQCPLQRGIIMTFGSGSNGCLGHGS 

LTDISQPTIVEALLGYEWQQVEEALSFTLLGSAPLDQEPLLSIDLGTAHSAAVTGEEDL 

GSGDVNRLPSWERGHIiliAGVASSTDVSTFSEGDCKEPDKCCWRHKQCTGHIIYPFASDCV 

RHSLHLHSVNHCNCNSRLKDSSEDSSSSRGAGPTCSHVIESPCFELTPEEEHVERFRYGW 

CKSYRPVSVAVIHHPLYHECGADDLNXKKRKRRRRKSKPPIPTQVGPATASPDLGTSMAT 

GTPDSTAPITIWRSESPTGKGQGSKVIKK\nCKKKEKEKDKEEMDEKAKLKKKAI^ 

KKSPVKLEPSPPDVSRSLSARQLARMSESSPESREELESEDSYNGRGQGELSSEDIVESS 

S PRKRENTVQAKKTGAKPSQARKVNKRKSPPGSNPNLS 

SEQ ID NO: 232_AA836348_H 

MSVLGEYERHCDSINSDFGSESGGCGDSSPGPSASQGPRAGGGAAEQEELHYIPIRVLGR 

GAFGEATLYRRTEDDSLVVWKEVDLTRLSEKERRDALNEIVILAIilOHDNI^ 

NTTLLIELEYCNGGNLYDKILRQKDKLFEEEMVVWYLFQIVSAVSCIHKAGILHRDIKTL 

NIFLTKANLIKLGDYGIiAKKLNSEYSMAETLVGTPYYMSPELCQGVKYNFKSDIWAV 

I FELLTLKRTFDATNPLNLCVKI VQGIRAMEVDS SQYSLELIQMVHSCLDQDPEQRPT^ 

ELLDRPLLRKRRRS STVTEAP I AWTSRTSEVYVWGGGKSTPQKLDVI KSGCSARQVCAG 

NTHFAVVTVEKELYTWVNMQGGTKLHGQLGHGDKASYRQPKHVEK^ 

TVCVTDEGQLYAFGSDYYGCMGVDKVAGPEVLEPMQLNFFLSNPVEQVSCGDNHVV^ 

NKEVYSWGCGEYGRLGIJDSEEDYYTPQKVDVPKALIIVAVQCGCDGTFLLTQSGKVLACG 

LNEFNKLGLNQCMSGIINHEAYHEVPYTTSFTLAKQLSFYKIRTIAPGKTHT 

LLTFGCNKCGQLGVGNYKKRLGINLLGGPLGGKQVIRVSCGDEFTIAATDEKVLNSKTIR 

SNS SGLS IGTVFQS S S PGGGGGGGGGEEEDSQQESETPDPSGGFRGTMEADRGMEGLI S P 

TEAMGNSNGASSSCPGWLRKELENAEFIPMPDSPSPLSAAFSESEKDTLPYEELQGLKVA 

SEAPLEHKPQVEASVTELFAFESQLVTSAESCSNLCWEGNTTDSSCVCVQLSAGGG 

SEQ ID NO: 233_R86668_H, MKK6_H 

MNLLLS YRDVQDYSAI I ELVETLQALPTCDVAEQHNVCFHYTFAIiNRRNRPGDRAKALSV 

LLPLVQLEGSVAPDLYCMCGRIYKDMFFSSGFQDAGHREQAYHWYRKAFDVEPSLHSGIN 

AAVLLIAAGQHFEDSKELRLIGMKLGCLLARKGCVEKMQYYWDVGFYLGAQILAl^ 

VLAAEQLYKIiNAPIWYIiVSVMETFLLYQHFRPTPEPPGGPPRRAHFWLHFLLQSCQPF 

ACAQGDQCLVLVLEmKVLLPAKLEVRGTDPVSTVTLSLLEPETQDIPSSWTFPVASICG 

VSASKRDERCCFLYALPPAQDVQLCFPSVGHCQWFCGLIQAWVTNPDSTAPAEEAEGAGE 

MLEFDYEYTETGERLVLGKGTYGWYAGRDRHTRVRIAIKEIPERDSRFSQPLHEEIALH 

RRLRHKNIVRYLGSASQGGYLKIFMEEVPGGSLSSLLRSVWGPLKDNESTISFYTRQILQ 

GLGYLHDNHI VHRD I KGDNVLINTFSGLLKI SDFGTSKRLAGI TPCTETFTGTLQYM^^ 

IIDQGPRGYGKAADIWSLGCTVIEMATGRPPFHELGSPQAAMFQVGMYKVHPPMPSSLSA 
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EAQAFLLRTFEPDPRLRASAQTLLGDPFLQPGKRSRSPSSPRHAPRPSDAPSASPTPSAN 

STTQSQTFPCPQAPSQHPPSPPKRCLSYGGTSQLRVPEEPAAEEPASPEESSGLSLLHQE 

SKRRAMLAAVLEQELPALAENLHQEQKQEQGARLGRNHVEELLRCLGAHIHT 

ELRALQGRLRAQGLGPALLHRPLFAFPDAVKQILRKRQIRPHWMFVLDSLLSRAVRAALG 

VLGPEVEKEAVSPRSEELSNEGDSQQSPGQQSPLPVEPEQGPAPLMVQLSLLRAETDRLR 

EILAGKEREYQALVQRALQRLNEEARTYVLAPEPPTALSTIXJGLVQWLQEL^^ 

LLiraSFTLHTLLTYATRDDLIYTRIRGGMVCRIWRAILAQRAGSTPVTSGP 

SEQ ID NO: 234_PAK6_H 

MFGKKKIOCI E I SGPSNFEHRVHTGFDPQEQKFTGLPQQWHSLIiADTANRPKPMVD 

PIQIAPMKTIVRGNKPCKETSINGLLEDFDNISVTRSNSLRKESPPTPDQGASSHGPGHA 

EENGFITFSQYSSESDTTADYTTEKYREKSLYGDDLDPYYRGSHAAKQNGHVMKMKHGEA 

YYSEVKPLKSDFARFSADYHSHLDSLSKPSEYSDLKWEYQRASSSSPLDYSFQFTPSRTA 

GTSGCSKESLAYSESEWGPSLDDYDRRPKSSYLNQTSPQPTMRQRSRSGSGLQEPMMPFG 

ASAFKTHPQGHSYNSYTYPRLSEPTMCIPKVDYDRAQMVLSPPLSGSDTYPRGPAKLPQS 

QSKSGYSSSSHQYPSGYHKATLYHHPSLQSSSQYISTASYLSSLSLSSSTYPPPSWGSSS 

DQQPSRVSHEQFRAALQLWSPGDPREYLANFIKIGEGSTGIVCIATEKHTGKQVAVKKM 

DLRKQQRRELLFNEWIMRDYHHDJm/Dl^SSYLVGDELWVViyEFLEGGA^ 

NEEQIATVCLSVLRALSYLHNQGVIHRDIKSDSILLTSDGRIKLSDFGFCAQVSKBVPKR 

KSLVGTPYWMAPEVISRLPYGTEVDIWSLGIMVIEMIDGEPPYFNEPPLQAMRRIRDSLP 

PRVKDLHKVSSVliRGFLDLMLWEPSQRATAQELLGHPFLKIiAGPPSC^ 

SEQ ID NO: 235_SURTK106_H 

MNDRNEIQMEAKLQSLTIIAQEILCRFFITLRRHARFLLTKLGRQGMARSGITHSCAVCI 

LCGPSREGDSPVAMGMTRMLLECSLSDKLCVIQEKQYEVIIVPTLLVTIFLILLGVILWL 

FIREQRTQQQRSGPQGIAPVPPPRDLSWEAGHGGNVALPLKETSVENFLGATTPALAKLQ 

VPREQLSEVLEQ I CSGS CGP I FRANMNTGDP S KPKS VI LKALKEPAGLHEVQDFLGRI QF 

HQYLGKHKNLVQLEGCCTEKLPLYMVLEDVAQGDLIiGFLWrCRRDVMTMDGLLYDLTEKQ 

VYHIGKQVLLALEFLQEKHLFHGDVAARNILMQSDLTAKLCGLGLAYEVYTRGAISSTQT 

IPLKWLAPERLLLRPASIRADVWSFGILLYEMVTLGAPPYPEVPPTSILEHLQRRKIMKR 

PSSCTHTl^SIMKSCWRWREADRPSPRELRLRLEAAIKTADDEAVLQVPELWPELY^ 

AGIRVESLFYNYSML 

SEQ ID NO: 236_AA098024_M 

LQEKHLFHGDVAARNILIQSDLTPKLCHLGLAYEVHAHGAISSARSSTIPLKWIiAPERLL 
LRPASIRGDIWSFGILLYEMVTLGAPPYPEVPPTSILQYLQRKKIMKRPSSCSHAJVrYN 
KCCWRWSEDSRPLLVQLLQRLEAASRSADDKAVLQVPELWPELYADVAGIRAESISYSF 
SVL 

SEQ ID NO: 237_SGK2ALPHA_H 

MNSSPAGTPSPQPSRANGNIl^PSANPNAQPTDFDFLKVIGKGNYGKVLLAKRKSDGAF 

YAVKVLQKKSILKKKEQSHIMAERSVLLKNVRHPFLVGLRYSFQTPEKLYFVLDY^ 

LFFHLQRERRFLEPRARFYAAEVASAIGYLHSLNIIYRDLKPENILLDCQGHWLTDFGL 

CKEGVEPEDTTSTFCGTPEYLAPEVLRKEPYDRAVDWWCLGAVLYEMLHGLPPFYSQDVS 

QMYENILHQPLQIPGGRTVAACDLLQSLLHKDQRQRLGSKADFLEIKNHVFFSPINWDDL 

YHKRLTPPFNPNVTGPADLKHFDPEFTQEAVSKSIGCTPDTVASSSGASSAFLGFSYAPE 

DDDILDC 
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SEQ ID NO: 238_CCRK_H 

MDQYCILGRIGEGAHGIVFKAKHVETGEI lALKKVALRRLEDGFPNQALRE IKALQEMED 

NQYWQLKAWPHGGGFVLAFEFMLSDIAEVVRHAQRPIAQAQA^ 

NNIVHRDLKPANLLISASGQLKIADFGLARVFSPIXSSRLYTHQVATRSVGCIMGEIim 

PLFPGKNDIEQLCYVLRILGTPNPQVWPELTELPDYNKISFKEQVPMPLEEVLPDVSPQA 

LDLLGQFLLYPPHQRIAASKALLHQYFFTAPLPAHPSELPIPQRLGGPAPKAHPGPPHIH 

DFHVDRPLEGVAVEPRADSALHPGGWSWPWSRLPAPQDHSVHLFLCmiPGFTL^GLPMA 

TVGPHHTLPLSPCEGWSRGRGHVPSQEYENIQSSRGDSWPVLGEPYLLCATDVPIRTVSS 

AASQGLHMQNDDACLGAASPECCLLVKEKCRE 

SEQ ID NO: 239_TESK2__H 

MDRSKRNSIAGFPPRVERLEEFEGGGGGEGNVSQVGRVWPSSYRALISAFSRLTRLDDFT 
CEKIGSGFFSEVFKVRHRASGQVMALKMNTLSSNRANMLK^QL^^ 
NLEQLLDSNLHLPWTVRVKIiAYDI AVGLS YLHFKGI FHRDLTSKNCLI KRDENGYSAWA 
DFGIJ^KIPDVSMGSEKLAVVGSPFWMAPEVLRDEPYNEKADVFSYGI ILCEI lARIQAD 
PDYLPRTENFGLDYDAFQHMVGDCPPDFLQLTFNCCNMDPKLRPSFVE I GKTLEE I LSRL 
QEEEQERDRKLQPTARGLLEKAPGVKRLSSLDDKIPHKSPCPRRTIWLSRSQSDIFSRKP 
PRTVSVIiDPYYRPRDGAARTPKVNPFSARQDLMGGKIKFFDLPSKSVISLVFDLDAPG 
TMPLADWQEPLAPPIRRWRSLPGSPEFLHQEACPFVGREESLSDGPPPRLSSLKYRVKEI 
PPFRASALPAAQAHEAMDCS I LQEENGFGSRPQGTS PCPAGAS EEMEVEERPAGST PATF 
STSGIGIiQTQGKQDG 
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SEQ ID NO: L_X69117_H BARK2_H 

ATGGCGGACCTGGAGGCCGTGCTGGCCGATGTCAGTTACCTGATGGCCATGGAGAAGAGC 

AAGGCGACCCCGGCCGCCCGCGCCAGCAAGAGGATCGTCCTGCCGGAGCCCAGTATCCGG 

AGTGTGATGCAGAAGTACCTTGCAGAGAGAAATGAAATAACCTTTGACAAGATTTTCAAT 

CAGAAAATTGGTTTCTTGCTATTTAAAGATTTTTGTTTGAATGAAATTAATGAAGCTGTA 

CCTCAGGTGAAGTTTTATGAAGAGATAAAGGAATATGAAAAACTTGATAATGAGGAAGAC 

CGCCTTTGCAGAAGTCGACAAATTTATGATGCCTACATCATGAAGGAACTTCTTTCCTGT 

TCACATCCTTTCTCAAAGCAAGCTGTAGAACACGTACAAAGTCATTTATCCAAGAAACAA 

GTGACATCAACTCTTTTTCAGCCATACATAGAAGAAATTTGTGAAAGCCTTCGAGGTGAC 

ATTTTTCAAAAATTTATGGAAAGTGACAAGTTCACTAGATTTTGTCAGTGGAAAAACGTT 

GAATTAAATATCCATTTGACCATGAATGAGTTCAGTGTGCATAGGATTATTGGACGAGGA 

GGATTCGGGGAAGTTTATGGTTGCAGGAAAGCAGACACTGGAAAAATGTATGCAATGAAA 

TGCTTAGATAAGAAGAGGATCAAAATGAAACAAGGAGAAACATTAGCTTTAAATGAAAGA 

ATCATGTTGTCTCTTGTCAGCACAGGAGACTGTCCTTTCATTGTATGTATGACCTATGCC 

TTCCATACCCCAGATAAACTCTGCTTCATCCTGGATCTGATGAACGGGGGCGATTTGCAC 

TACCACCTTTCACAACACGGTGTGTTCTCTGAGAAGGAGATGCGGTTTTATGCCACTGAA 

ATCATTCTGGGTCTGGAACACGTGCACAATCGGTTTGTTGTCTACAGAGATTTGAAGCC^ 

GCAAATATTCTCTTGGATGAACATGGACACGCAAGAATATCAGATCTTGGTCTTGCCTC 

GATTTTTCCAAAAAGAAGCCTCATGCGAGTGTTGGCACCCATGGGTACATGGCTCCCGAG 

GTGCTGCAGAAGGGGACGGCCTATGACAGCAGTGCCGACTGGTTCTCCCTGGGCTGCATG 

CTTTTCT^CTTCTGAGAGGTCACAGCCCTTTCAGACAACATAAA^ 

GAAATTGACCGAATGACACTCACCGTGAATGTGGAACTTCCAGACACCTTCTCTCCTGAA 

CTGAAGTCCCTTTTGGAGGGCTTGCTTCAGCGAGACGTTAGCAAGCGGCTGGGCTGTCAC 

GGAGGCGGCTCACAGGAAGTAAAAGAGCACAGCTTTTTCAAAGGTGTTGACTGGCAGCAT 

GTCTACTTACAAAAGTACCCACCACCCTTGATTCCTCCCCGGGGAGAAGTCAATGCTGCT 

GATGCCTTTGATATTGGCTCATTTGATGAAGAGGATACCAAAGGGATTAAGCTACTTGAT 

TGCGACCAAGAACTCTACAAGAACTTCCCTTTGGTCATCTCTGAACGCTGGCAGCAAGAA 

GTAACGGAAACAGTTTATGAAGCAGTAAATGCAGACACAGATAAAATCGAGGCCAGGAAG 

AGAGCTAAAAATAAGCAACTTGGCCACGAAGAAGATTACGCTCTGGGGAAGGACTGTATT 

ATGCACGGGTACATGCTGAAACTGGGAAACCCATTTCTGACTCAGTGGCAGCGTCGCTAT 

TTTTACCTCTTTCCAAATAGACTTGAATGGAGAGGAGAGGGAGAGTCCCGGCAAAATTTA 

CTGACAATGGAACAGATTCTCTCTGTGGAAGAAACTCAAATTAAAGACAAAAAATGCATT 

TTGTTCAGAATAAAAGGAGGGAAACAATTTGTCTTGCAATGTGAGAGTGATCCAGAGTTT 

GTGCAGTGGAAGAAAGAGTTGAACGAAACCITCAAGGAGGCCCAGCGGCTA 

GCCCCGAAGTTCCTCAACAAACCTCGGTCAGGTACTGTGGAGCTCCCAAAGCCATCCCTC 

TGTCACAGGAACAGCAACGGCCTCTGA 

SEQ ID NO: 2_AA144574_M BARK2_M 

CTGCTTCGTAGTCTACAGAGACCTGAAGCCTGCGAACATCCTCCTAGATGAATATGGGCA 
CGTGAGGATATCGGATCTCGGCCTTGCCTGTGATTTCTCCAAAAAGAAGCCTCATGCCAG 
CGTGGGCACCCATGGGTACATGGCTCCCGAGGTGTTGCAGAAGGGAACGTGCTATGACAG 
CAGCGCCGACTGGTTCTCCCTGGGCTGTATGCTCTTCAAACTTCTGCGGGGCCACAGCCC 
CTTCAGGCAGCATAAAACCAAAGACAAGCATGAGATAGACCGAATGACCCTGACCGTGAA 
CGTGCAGCTTCCAGATGCCTTCTCCCCTGAGCTGAGGTCCCTCTTAGAGGGTTTGCTCCA 
GCGGGACGTGAGCCAGCGGCTGGGCTGCGGAGGAGGAGGGGCACGAGAGTTGAAGGAGCA 
CATCTTCTTCAAGGGCATTGACTGGCAGCATGTGTACTTACGGAAGTACCCGCCACCCCT 
AATCCCTCCTCGGGGAGAGGTCAACGCTGCAGATGCCTTCGATATCGGCTCCTTCGATGA 
GGAAGACACCAAAGGCATTAAGCTGTTGGACTGTGACCAGGACCTCTATAAGAACTTCCC 
ACTGGTGATCTCCGAGCGCTGGCAGCAAGAAGTGGTGGAGACCATCTATGACGCCGTCAA 
TGCTGATACTGATAAAATCGAGGCCAGGAAGAAGGCTAAAAATAAGCAACTTGGTCAAGA 
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GGAAGATTACGCTATGGGGAAGGACTGCATCATGCACGGGTACATGCTGAAGCTGGGGAA 

CCCCTTTCTCACACAGTGGCAAAGACGCTATTTTTACCTGTTCCCCAACAGAC 

GAGAGGAGAGGGCGAGTCTCGGCAAAGTCTACTGACCATGGAACAGATCATGTCTGTGGA 

GGAGACCCAGATTAAAGACAGAAAGTGCATCTTACTCAGGATAAAGGGAGGGAACklAATT 

TGTCTTGCAATGTGAGAGTGACCCCGAGTTTGCACAGTGGCTGAAGGAGCTGACCTGCAC 

CTTCAATGAGGCCCAGAGACTGCTGCGCCGTGCCCCCAAATTCCTCAACAAACCACGGGC 

CGCCATCCTGGAGTTCTCCAAGCCACCACTGTGTCACAGAAATAGCAGCGGCCTCTGAAC 

CACAGAGCAGCGGGGCCTGAAGGAGGGGCCCCAGCTCTTCAGCCCJ^GGAGTGGAACG^ 

CCACGGGGAACCGTGTGGGGCTAAGACACAGTGTTTCTGAGCACTGACGGGGCTGCTCCA 

AGCCGAGGAGGCTCAGGACACCAGGGCGGCCTTCTGGGAGCTGGGACATCCTCGGGGCTG 

TCCTATCCACACTCGAAATTACTGAAGAAGCAGAGGCATTCTGCTGTG 

SEQ ID NO: 3_AA826850_H 

GAAGAGGATGGGCTCGTCCATGTCGGCGGCCACCGCGCGGAGGCCGGTGTTTGACGACAA 

GGAGGACGTGAACTTCGACCACTTCC^GATCCTTCGGGCCTVTTGGGAAGGGCAGCTTTGG 

CAAGGTGTGCATTGTGCAGAAGCGGGACACGGAGAAGATGTACGCCATGAAGTACAT(^ 

CAAGCAGCAGTGCATCGAGCGCGACGAGGTCCGCAACGTCTTCCGGGAGCTGGAGATCCT 

GCAGGAGATCGAGCACGTCTTCCTGGTGAACCTCTGGTACTCCTTCCAGGACGAGGAGGA 

CT^TGTTCATGGTCGTGGACCTGCTACTGGGCGGGGACCTGCGCTACCACCTGCAGCAGAA 

CGTGCAGTTCTCCGAGGACACGGTGAGGCTGTACATCTGCGAGATGGCACTGGCTCTGGA 

CTACCTGCGCGGCCAGCACATCATCCACAGAGATGTCAAGCCTGACAACATTCTCCTGGA 

TGAGAGAGGACATGCACACCTGACCGACTTCAACATTGCCACCATCyiTC^ 

GCGGGCGACGGCATTAGCAGGCACCAAGCCGTACATGGCTCCGGAGATCTTCCAXTCTTT 

TGTCAACGGCGGGACCGGCTACTCCTTCGAGGTGGACTGGTGGTCGGTGGGGGTGATGGC 

CTATGAGCTGCTGCGAGGATGGAGGCCCTATGACATCCACTCCAGCAACGCCGTGGAGTC 

CCTGGTGCAGCTGTTCAGCACCGTGAGCGTCCAGTATGTCCCCACGTGGTCCAAGGAGAT 

GGTGGCCTTGCTGCGGAAGCTCCTCACTGTGAACCCCGAGCACCGGCTCTCCAGCCTCCA 

GGACGTGGAGGCAGCCCCGGCGCTGGCCGGCGTGCTGTGGGACCACCTGAGCGAGAAGAG 

GGTGGAGCCGGGCTTCGTGCCCAACAAAGGCCGTCTGCACTGCGACCCCACCTTTGAGCT 

GGAGGAGATGATCCTGGAGTCCAGGCCCCTGCACAAGAAGAAGAAGCGCCrGGCC^ 

CAAGTCCCGGGACAACTIGCAGGGACAGCTCCCAGTCCGAGAATGACTATCTTCTV^ 

CCTCGATGCCATCCAGCAAGACTTCGTGATTTTTAACTVGAGAAAAGCTGAAGAGGAGCCA 

GGACCTCCCGAGGGAGCCTCTCCCCGCCCCTGAGTCCAGGGATGCTGCGGAGCCTGTGGA 

GGACGAGGCGGAACGCTCCGCCCTGCCCATGTGCGGCCCCATTTGCCCCTCGGCCGGGAG 

CGGCTAGGCCGGGATGCCCGTGGTCCTCACCCCTTGAGCTGCTTTGGAGACTCGGCTGCC 

AGAGGGAGGGCCTVTGGGCCGAGGCCTGGCATTCACGTTCCCACCCAGCCTGGCTGGC^ 

GCCCACAGTGCCCCGGACACATTTCACACCTCAGGCTCGTGGTGGTGCAGGGGAC^ 

GCTGTGGGTGCAGGGGACACCTGTGGAGGGCATTTCCCGTGGGCCCCCGAGACCCGCCTA 

GATGGAGGAAGCGCTGCTGGGCGCCCTCTTACCGCTCACGGGGAGCTGGGGCCATGGATG 

GGACAGGAGTCTTTGTCCCTGCTCAGCCCGGAGGCTGTGCACGGCCCTCGTCACAAGGTG 

ACCCITGCAGCACAGGCCGCGGGTGCCCCAGGCTCGGCTCAGTTCTTGGAGGTCAAC^ 

ATGGGTTGGGGTAGTGGGTGGGGAGGTGAATGTTTTCTAGAGATTCAAACTGCTCCAGCA 

ATTTCTGTATAGTTTTCACCTCTGAGAATTACAATGTGAGAACCGCTC 

SEQ ID NO: 4_AA960957_H 

GTCCCACATCCCGCATCCGGCATCCCAGCGGCCGGGCATGTAGCAGCGGCAGCAACGGCG 
GAATATGGGCGGGAACCACTCCCACAAGCCCCCCGTGTTTGACGAGAATGAGGAAGTCAA 
CTTTGACCATTTTCAGATTCTGCGGGCCATTGGTAAAGGGAGTTTTGGAAAGGTATGCAT 
CGTGCAGAAGCGAGACACTAAGAAAATGTATGCAATGAAGTACATGAACAAGCAGAAGTG 
CATCGAGAGGGATGAGGTTCGGAATGTTTTCCGGGAGCTGCAGATCATGCAAGGGCTGGA 




wo 00/73469 



PCT/USOO/14842 



nCURE 2C 



GCACCCCTTCCTGGTCAATCTGTGGTACTCCTTCCAGGATGAGGAGGACATGTTCATGGT 

GGTGGACCTQCTCCTGGGAGGCGACCTGCGCTACCATCTGCAGCAGAATGTGCATTTCAC 

AGAGGGGACTGTGAAACTCTACATCTGTGAGCTGGCACTGGCCCTGGAGTATCTTCAGAG 

GTACCACATCT^TCCACAGAGACATCAAGCCAGACAATATCCTGCTGGATGAACACGGACA 

TGTTCACATTACAGACTTCAACATAGCGACGGTAGTGAAAGGAGCAGAAAGGGCTTCCTC 

CATGGCTGGCACCAAGCCCTACATGGCTCCAGAAGTATTCCAGGTGTACATGGACAGAGG 

CCCCGGATACTCGTACCCTGTCGACTGGTGGTCCCTGGGCATCACAGCCTATGAGCTGCT 

GCGGGGCTGGAGGCCGTACGAAATCCACTCGGTCACGCCCATCGATGAAATCCTCAACAT 

GTTCAAGGTGGAGCGTGTCCACTACTCCTCCACGTGGTGCAAGGGGATGGTGGCCCTGCT 

GAGGAAGCTCCTGACCAAGGATCCTGAGAGCCGCGTGTCCAGCCTTCATGACATACAGAG 

CGTGCCCTACTTGGCCGACATGAACTGGGACGCGGTGTTCAAGAAGGCACTGATGCCCGG 

CTTTGTGCCCAATAAAGGGAGGTTGAACTGCGATCCCACTITTTGAGCTTGAAGAGATGAT 

TCTAGAATCCAAGCCACTTCACAAAAAGAAGAAGCGATTGGCAAAGAACAGATCCAGGGA 

TGGCACAAAGGACAGCTGCCCGCTGAATGGACACCTGCT^GCACTGTTTGGAGACTGTCCG 

GGAGGT^TTCATCATATTCAACAGAGAGAAGCTCAGGAGGCAGCAGGGACAQ 

GCTCTTGGACACCGACAGCCGAGGGGGAGGCCAGGCCCAAAGCAAGCTCCAGGACGGG 

CAACAACAACCTCCTCACCCACACCTGCACCCGTGGCTGCAGCAGCTGAGCCCACACTTG 

TTGCTGCTCAACAGGACTGCACTCGTCTCTGCCCTGCCCACCCAGAGCCCCTCTTTGTGC 

CCTGATGGTCCCTGTCTCACCCCTGAAAACATCAGATGCAGAAAAAGCCCTGGACTTGGA 

GCTGGGAAGCCTGGGTTCTGGTCCCATCTCCATGACTGATTCACGTGTGACCTCAGACAA 

GTCACGCCCTCTCTGTGCCTCCGTTTTCTGCATCTGCCAAAGGGGTTAAACACTTCTGCC 

CCACTTCAAATTAGAAGATTATGGGGAGAACCCAATTAGGTAGGAAACATGAAAAACCTT 

TGATATTTATAAAATCATTTTTACGTGCAAAATATAACCTTAATATTTGAAGTGACCCCC 

ATTCCCCAAAGCAATCAAACCGTCATGACTTTGCAATTTGGCACATCCTAGCTTGTTAGA 

GGGCACTTCCGAAAAACACAGCCCTGACAGCAAAATAAAGGTCTGATATGTTGGCCCCTT 

CTATGGAAACAACGCTGCCAAATCCTGGAGCAAAACCTGAAGTGTCTTCATGTGCATTCT 

CTGGCAGGCCACAGTCCTGAGCTTGTAAGATGGTGCAGCATGCAGACCAGACTTGTCCCC 

AAGGTCTCAGCGCTGCGGTCTCACTCCTCCCCTCATTTAAGAAGACTATCCTTACCTTTT 

AGTTTCAGCAGTCCTCACCACCACCATATCCCCAGTGCTGGGATGGCACACAGGTGTCCA 

TTCAGATGAGAGTTGGGTCGCTGAGCATTGGTTACTCCTG(ZAGAGTGTAATCAGCACCCC 

ATCCAACTGGCCCGAAAGCCCAGACCTGCAGCAGAACTCTCCAACTCTCTATCAGCTTTC 

AGGGTTTTCTCTCCTGGGAAGGGTGTAAAATCAGCTTGTCAGATTCTTCTTACAGAGAGT 

ATCCAATCGGTATTGGTGGAGCGGCTCCCTATTTATACAATAGGAAGCATGGGTGCTTAG 

AAAGTTTATTTCAGGAGGAAAATGGGTTCACACAAAAAGCAAACTACATTCTGATCTGCT 

CAGGGAGAAGCTTGCCTTTGAACTGGAAGATGTTGGGATGAGCAGGGAAAGCTTAGACTT 

TGGAGTCAGGTTTGTGTTCAGAATCCAGCCCTGCTGGCTACTAACTAACTGGGAGACCTT 

AGGCAAAGCATGCAATCGCTCTGAATGGCAGTTTCCTCATTTTTAAACAGGGATAATAAA 

ACTAATATTGCAGGGGAGTTACAGGGTTAAATAAGATCCTGTGTGTAACCCCAAGCATTG 

GATGACTCATAGAATGGCCTTTTTTGTCAGCATAATCGTCATCATTATTTAGATACTTTC 

TTCCTTCACTCACCCAGCAGGTCAGTTTTCTGTGC?y^CAAACCTGTTTAGGATTCT^ 

AAATGTTCTTCCTGGGGTCTTTGATATTTGTTTGTTACATCCTGCTGAAGTTCGACTGTG 

TTTTTATTTTTTCATCCAACTTCCATTTTTCACTTTTTACATGATTACTCAATCCTTGGG 

GCTGTCCATGTCATCTCTTAGATTTCTTAAAAGACATTTTAATGTATGGTTAGGTTTTAT 

ATTTTTATTTTTTAAAAAAGAAATAGTCAGTGTTTTCCTCCTTTCAACCGAGACTATTTC 

TGGATTGTGTGCTCCTCGTCAGTTGACTTGTTTTGCACACTTTTCTTTACTTCATGTCCC 

CATCAACAACCGTCCTGCTCCCCACCTCCCCCAGGAAATAAGGGGCCTGCTCCTCTCCCT 

ACTGTGACCCTGGAGGCTCTTAAGATGATGATGGTTTTTTTTATTGGGCTGAGTTCACGA 

ATTAGGGGCAGGAGCTGGAAGTCGCCCTAGGAACACCAGATTTCCTGGTTCTGTTCAAGT 

TGGCATTTCTTGTTTGGAATAAACTATTTCTTGGACATTCCTTC 
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SEQ ID NO: 5_TBK1_H 

TCCTGAGTCTCGAGGAGGCCGCGGGAGCCCGCCGGCGGTGGCGCGGCGGAGACCCGGCTG 

GTATAACAAGAGGATTGCCTGATCCAGCCAAGATGCAGAGCACTTCTAATCATCTGTGGC 

TTTTATCTGATATTTTAGGCCAAGGAGCTACTGCAAATGTCTTTCGTGGAAGACATAAGA 

AAACTGGTGATTTATTTGCTATCAAAGTATTTAATAACATAAGCTTCCTTCGTCCAGTGG 

ATGTTCAAATGAGAGAATTTGAAGTGTTGAAAAAACTCAATCACAAAAATATTGTCAAAT 

TATTTGCTATTGAAGAGGAGACAACAACAAGACATAAAGTACTTATTArGGAATTTTGTC 

CATGTGGGAGTTTATACACTGTTTTAGAAGAACCTTCTAATGCCTATGGACTACCAGAAT 

CTGAATTCTTAATTGTTTTGCGAGATGTGGTGGGTGGAATGAATCATCTACGAGAGAATG 

GTATAGTGCACCGTGATATCAAGCC7VGGAAATATCATGCGTGTTATAGGGGAAGATGGAC 

AGTCTGTGTACAAACTCACAGATTTTGGTGCAGCTAGAGAATTAGAAGATGATGAGCAGT 

TTGTTTCTCTGTATGGCACAGAAGAATATTTGCAGCCTGATATGTATGAGAGAGCAGTGC 

TAAGAAAAGATCATCAGAAGAAATATGGAGCAACAGTTGATCTTTGGAGCATTGGGG 

CATTTTACCATGC7VGCTACTGGATCACTGCCATTTAGACCCTTTGAAGGGCCTCGTAGGA 

ATAAAGAAGTGATGTATAAAATAATTACAGGAAAGCCTTCTGGTGCAATATCTGGAGTAC 

AGAAAGCAGAAAATGGACCAATTGACTGGAGTGGAGACATGCCTGTTTCITGC^ 

CTCGGGGTCTTCAGGTTCTACTTACCCCTGTTCTTGCAAACATCCTTGAAGCAGATCAGG 

AAAAGTGTTGGGGTTTTGACCAGTTTTTTGCAGAAACTAGTGATATACTTCACCGAATGG 

TAATTCATGTTTTTTCGCTAGAACAAATGACAGCTCATAAGATTTATATTC^ 

ATACTGCTACTATATTTCATGAACTGGTATATAAACAAACCAAAATTATTTCTTCAAATC 

AAGAACTTATCTACGAAGGGCGACGCTTAGTCTTAGAACCTGGAAGGCTGGCACAACATT 

TCCCTAAAACTACTGAGGAAAACCCTATATTTGTAGTAAGCCGGGAACCTCTGAATACCA 

TAGGATTAATATATGAAAAAATTTCCCTCCCTAAAGTACATCCACGTTATGATTTAGACG 

GGGATGCTAGCATGGCTAAGGCAATAACAGGGGTTGTGTGTTATGCCTGCAGAATTGCCA 

GTACCTTACTGCTTTATCAGGAATTAATGCGAAAGGGGATACGATGGCTGATTGAATTAA 

TTAAAGATGATTACAATGAAACTGTTCACAAAAAGACAGAAGTTGTGATCACATTGGATT 

TCTGTATCAGAAACATTGAAAAAACTGTGAAAGTATATGAAAAGTTGATGAAGATCAACC 

TGGAAGCGGCAGAGTTAGGTGAAATTTCAGACATACACACCAAATTGTTGAGACTTTCCA 

GTTCTCAGGGAACAATAGAAACCAGTCTTCAGGATATCGACAGCAGATTATCTCCAGGTG 

GATCACTGGCAGACGCATGGGCACATCAAGAAGGCACTCATCCGAAAGACAGAAATGTAG 

AAAAACTACAAGTCCTGTTAAATTGCATGACAGAGATTTACTATCAGTTCAAAAAAGACA 

AAGCAGAACGTAGATTAGCTTATAATGAAGAACAAATCCACAAATTTGATAAGCAAAAAC 

TGTATTACCATGCCACAAAAGCTATGACGCACTTTACAGATGAATGTGTTAAAAAGTATG 

AGGCATTTTTGAATAAGTCAGAAGAATGGATAAGAAAGATGCTTCATCTTAGGAAACAGT 

TATTATCGCTGACTAATCAGTGTTTTGATATTGAAGAAGAAGTATCAAAATATCAAGAAT 

ATACTAATGAGTTACAAGAAACTCTGCCTCAGAAAATGTTTACAGCTTCCAGra 

AACATACCATGACCCCAATTTATCCAAGTTCTAACACATTAGTAGAAATGACTCTTGGTA 

TGAAGAAATTAAAGGAAGAGATGGAAGGGGTGGTTAAAGAACTTGCTGAAAATAACCACA 

TTTTAGAAAGGTTTGGCTCTTTAACCATGGATGGTGGCCTTCGCT^CGTTGACTGTCTTT 

AGCTTTCTAATAGAAGTTTAAGAAAAGTTTCCGTTTGCACAAGAAAATAACGCTTGGGCA 

TTAAATGAATGCCTTTATAGATAGTCACTTGTTTCTACAATCCAGTATTTGATGTGGTCG 

TGTAAATATGTACAATATTGTAAATACATAAAAAATATACAAATTTTTGGCTGCTGTGAA 

GATGTAATTTTATCTTTTAACATTTATAATTATATGAGGAAATTTGACCTCAGTGATCAC 

GAGT^GAAAGCCATGACCGACCAATATGTTGACATACTGATCCTCTACTCTGAGTGGGGC 

TAAATAAGTTATTTTCTCTGACCGCCTACTGGAAATATTTTTAAGTGGAACCAAAATAGG 

CATCCTTACAAATCAGGAAGACTGACTTGACACGTTTGTAAATGGTAGAACGGTGGCTAC 

TGTGAGTGGGGAGCAGAACCGCACCACTGTTATACTGGGATAACAATTTTTTTGAGAAGG 

ATAAAGTGGCATTATTTTATTTTACAAGGTGCCCAGATCCCAGTTATCCTTGTATCCATG 

TAATTTCAGATGAATTATTAAGCAAACATTTTAAAGTGAATTCATTATTAAAAACTATTC 

ATTTTTTTCCTTTGGCCATAAATGTGTAATTGTCATTAAAATTCTAAGGTCATTTCAACT 
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GTTTTAAGCTGTATATTTCTTTAATTCTGCTTACTATTTCATGGAAAAAAATAAATTTCT 
CAATTTTAAAAAA 

SEQ ID NO: 6_AA305176_H 

TGGCTGCTCGCGGAGGGGCAGTGTACGCGGGGCCGCTGTAGGCTGTCCAGCGATGGATCC 

CACCGCGGGAAGCAAGAAGGAGCCTGGAGGAGGCGCGGCGACTGAGGAGGGCGTGAATAG 

GATCGCAGTGCCAAAACCGCCCTCCATTGAGGAATTCAGCATAGTGAAGCCCATTAGCCG 

GGGCGCCTTCGGGAAAGTGTATCTGGGGCAGAAAGGCGGCAAATTGTATGCAGTAAAGGT 

TGTTAAAAAAGCAGACATGATCAACAAAAATATGACTCATCAGGTCCAAGCTGAGAGAGA 

TGCACTGGCACTAAGCAAAAGCCCATTCJ^TTGTCCATTTGTATTATTCACTGCAGTCTG 

AAACAATGTCTACTTGGTAATGGAATATCTTATTGGGGGAGATGTCAAGTCTCTCCTACA 

TATATATGGTTATTTTGATGAAGAGATGGCTGTGAAATATATTTCTGAAGTAGCACTGGC 

TCTAGACTACCTTCACAGACATGGAATCATCCTVCAGGGACTTGAAAC 

TATTTCTAATCAGGGTCATATTAAACTGACGGATTTTGGCCTTTCAAAAGTTACT^ 

TAGAGATATTAATATGATGGATATCCTTACAACACCATCTVATGGCT^AAACCT^ 

TTATTCAAGAACCCCAGGACAAGTGTTATCGCTTATCAGCTCGTTGGGATTTAACACACC 

AATTGCAGAAAAAAATCAAGACCCTGCT^CATCCTTTCAGCCTGTCTGTCTGAAACATC 

ACAGCTTTCTCAAGGACTCGTATGCCCTATGTCTGTAGATCT^AAAGGACACTACGCCTTA 

TTCTAGCAAATTACTAAAATCATGTCTTGAAACAGTTGCCTCCAACCCAGGAATGCCTGT 

GAAGTGTCTAACTTCTAATTTACTCCAGTCTAGGAAAAGGCTGGCC7VCATCCAGTC 

TAGTCAATCCCACACCTTCATATCCAGTGTGGAATCAGAATGCCACAGCAGTCCCAAATG 

GGAAAAAGATTGCCAGGTTTGAGGGACATTTATCTTAATGAAAATCAATTATGTATGTCA 

AATGAATGTGAGAAATATTATACCTTTTCATATAAATTCCATAAAGAAATGAAATTGTTA 

CATGAATGGCAGTCATAGTATTAATCAGAAATTCATTTTCCTGCACATTCTGTCAAATT^ 

TTTTGAAATATTTCATTTCTCATTCAATTGTGACATTGTTCTTACTTGATTATATAAT^ 

GATTCTTGCAGTAAATTGATAATAAATGCTTGGCTTCTGTGTATCTAGGTGGACCTCACT 

TGTTTTTAGAAGTCCTTCCCATGATACAGACATTGGCTTGTTGGTTTTGTTTTATTTTGT 

TTTTAACATATGTCATTTAAAAACTCATATTACCTCCTTTT 

SEQ ID NO: 7_AA116841_M 

CCACGCGTCCGATCCCATGGCCAGAAGGCGAAGAAAAGCTATCTGATAATGCTCAAAGTG 

CAATGGACATGCTTTTAACCATTGATGATTCAAAGAGAGCTGGAATGAGAGAACTAAA^ 

AGCATCCTCTCTTCAGTGAAGTGGACTGGGAAAATCTGCAGCTITCT^GACTATGCCT 

TACCCCAACCAGACGACGAAACAGATACATCCTATTTTGAAGCCAGAAATAATGCTCAA^ 

ATCTGACCGTATCTGGGTTTAGTCTGTAGCACATGCGTGTCATTTTTATCTAACTTGTGA 

TATAGAATTAAGTTTTACAGTAATATGCTACTTAATACTAGATTGGTCTAAATGGGATAA 

AAGTCATTATTTTACCCAGACTGAACAGCTTTTAATTACTAAGTACAACAGTTTTTACAG 

AATTAAAATACTATAAGCAATATAATCAGTAATTAATCTTTACCTTAGAACTGTATATAA 

GCCATAATAGCTTTTTTCTVTCTTATTTATTCACTGCACTTTATGAAGAGCAAAGTATC^ 

TAAACTAAAACACTACCACTCTAAATAGAGGGAGTGAGCCGT 

SEQ ID NO: 8_AA256100_H 

AGGGAGCTGACGGGCGCCCGGCCGGCTGCGGTCCGTGCGGAGGCTGAGCCGGCCGCGGGC 
GCGACCGGAGGCAGTTTCCGTTACTATGGCAATGACGGCAGGGACTACAACAACCTTTCC 
TATGAGCAACCATACCCGGGAAAGAGTGACTGTAGCCAAGCTCACATTGGAGAATTTTTA 
TAGCAACCTAATTTTACAGCT^TGAAGAGAGAGAAACCAGGCAGAAGAAATTAGAAGTGGC 
CATGGAAGAAGAAGGATTAGCAGATGAAGAGAAAAAGTTACGTCGATCACAACACGCTCG 
CAAAGAAACAGAGTTCTTACGGCTCAAAAGGACCAGACTTGGCTTGGATGACTTTGAGTC 
TCTGAAAGTTATAGGAAGAGGAGCTTTTGGAGAGGTGCGGTTGGTCCAGAAGAAAGATAC 
AGGCCATATCTATGCAATGAAGATATTGAGAAAGTCTGATATGCTTGAAT^AAGAGCAGGT 
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GGCCCATATCCGAGCAGAAAGAGATATTTTGGTAGAAGCAGATGGTGCCTGGGTGGTGAA 

GATGTTTTACAGTTTTCAGGATAAGAGGAATCTTTATCTAATCATGGAATTTCTCCCTGG 

AGGTGACATGATGACATTGCTAATGAAGAAAGACACCTTGACAGAAGAGGAAAC^CAGT^ 

CTACATTTCAGAGACTGTTCTGGCAATAGATGCGATCCACCAGTTGGGTTTCATCCATCG 

GGATATTAAGCCAGACAACCTTTTATTGGATGCCAAGGGTCATGTAAAATTATCTGATTT 

TGGTTTATGTACGGGATTAAAGAAAGCTCACAGGACTGAATTTTATAGAAATCTCACACA 

CAACCCACCAAGTGACTTCTCATTTCAGAACATGAACTCAAAGAGGAAAGCAGAAACTT^ 

GAAGAAGAACAGGAGACAACTGGCATATTCCACAGTTGGGACACCAGATTACATTGCTCC 

AGAAGTATTCATGCAGACTGGTTACAACAAATTGTGTGACTGGTGGTCTTTGGGAGTGAT 

TATGTATGAAATGCTAATAGGATATCCACCTTTCTGCTCTGAAACACCTCAAGAAACATA 

CAGAAAAGTGATGAACTGGAAAGAAACTCTGGTATTTCCTCCAGAGGTACCTATATCTGA 

GAAAGCCAAGGACTTAATTCTCAGATTTTGTATTGATTCTGAAAACAGAATTGGAAATAG 

TGGAGTAGAAGAAATAAAAGGTCATCCCTTTTTTGAAGGTGTCGACTGGGAGCACATAAG 

GGAAAGGCCAGCAGCAATCCCTATAGAAATCTkAAAGCATTGATGATACTTCAAATTTTGA 

TGACTTCCCTGAATCTGATATTTTACAACCAGTGCCAAATACCACAGAACCGGA 

ATCCATUIGACTGGGTTTTTCTCAATTATACCTATAAAAGGTTTGAAGGGTTGACTCAACG 

TGGCTCTATCCCCACCTACATGAAAGCTGGGAAGTTATGAATGAAGATAACATTCACCCA 

TAACCAAGAGAACTCTVGGTAGCTGCATCACCAGGCTTGCTTGGCGTAGATAACT^TA^ 

TGAAATACTCCTGAAGATGGTGGTGCTTATTGACTACAAGAGGAAATTCTACAGGATTAG 

GATTTCTAAGACTACTATAGGAATTGGTTGGCAGTGCCAGCTGGCTCTTTTTTTTAATAT 

TTTATTATTTTTGTTAACTTTATTATATGAAGGTACTGGAATAAAAGGAACAGACATCCC 

TTTCTAACTGCACTGCCTACATGCGTATTAAGGTCCATTCTGCCTGTGTGTGCTGTGGCT 

TTGAACTGTAACACCTCTAATCAATTCAGGAGAAACTVCATATCATTTAAAGCT^CATA^ 

CTAACCTGTAGGTAACACTGCAGTATTGATGTTTTACTGCAAATCTTATGGGTCTAGATA 

ATCAGTAAAAGCCATCTTCCATAGTTGGTGTTAGAACATTGCCCTATTGGTTTGGACATC 

TGTAGAATATATATGAAGACAATTTCTGTAATGGTTTTAAGAGATTTAAAAAGAAATTCA 

CTGGTTCTTTACAAAATAGAATTTATCATCAAGTTATTACACAAACTTCACAGTAAGGAG 

TGACAAGTTTATAATAAGGAAGACAAAGTTTAACACCTTCACTCAAGCACTCCACTAATA 

TATTTACGTTGCATTCAGAAATACTGATGACCTTCATATACGTAGTCTGTATACTCATAG 

GGAGATGTACTGTATTATATAACATGTAAAGTTGATTTTCTTGTGACAAGAGAACTTCTT 

TTTTTAACTU^GAGGACATGGCATTATTTTAATTTGATTATGGTGAGTTGAATTTAAGAC^ 

TGACCATGAAGGCTGCTTGTAGAATTAGTGTATTTTTATTAAACTATTTTTTTAAATGTC 

AAACTTCTATCATGTAAATGGACTTATAGAGAACAAAAAGCTATTTACTTTGGTTTTCTA 

GAAAGTTGTTACATATCATGGCTGGTTAACTTTTATTTCTTTTGATGAAAATTTTTCCTT 

TGATAGTACTTGTATTATTGTGCCATTATTTTCTTATGCTCCAAATGTACCAAAGATCTT 

gaacAgagtggatgttcticaactgagtagaattttcctttcctgtgggcatgctgtact 
agacctgacagatctttgatagaggtcagcttattaaagggcaatattgttcttgtttag 

CTAC71TC?VCTGTGGTGAATATAGATGGAATTAAGGAAGTAAATGCAGGCCAGGGGGTTGT 

gatgagaggataggggagataatatcagcatcaaattctttgggtatctctctaagaatt 
aaataatcttttctagcttaatattttaattctaattcaaacaactctgaggttttggtt 

TCATTAGTAATAGTTGAGGAATAATATACTAGCAAAGAATGGCCTAATGTTTGTCATAAC 

tgttaatggatgaaattttttaaagatacaaccatgataaccattataaatgatctatga 
tcaaaatctaaagtgatgaattatttgtaggaatgtcttcctaatggggaagaattgcat 

AGGAGCATTATGCT^TCTACACAAGCTTTTATAAATGTTGCTGCTGGGTAGCTCCAC^ 

TGTTTCATAAGGCCATCCTGTTTCCCCCAACTCCCCCATTTTTGGTTTGTTTCTTTTTAA 

ATATTTGTTGAGTACTTACGTGTTTATCTAACy^GTTCACTTCCATTTTTCTAGTCTGGAT 

TTTTTGAGTATTTAGGAAAGAGAGCTATTAAAAACTCTGGGGATTTCTCAATGTGACTAA 

CTCTAATTTTTCTAATTATAACTGCCTTTAATTAACATAATATTAACTTTTGCTGAGGTT 

TATGAGATTTTCTCACCCCACATCGCTCCCCTTTTTTTAA/^GGACTGTTTTGCTAGTG 

TGATAATGAATAGGTAAGATATGAGATAATTGCAACATTGTCCTAGTTCTAGTATGGTAA 
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CTATTCTTGAAATGGTATTGAAAAATACCGTTAATTCAAATTGACAGAGATTGATAAAAA 
GAAACTGATTTACCTAAGTTTACTTTTTAATTGCATAATAGAGCATTTTTTGTTTTGAGT 
TCCCTCATTCTTATTACCAGAAAGAGCTTGCA^TAGTTTTACTTTCTTGGCACTGGAAG 
GGTAGTTCTGGAAAGCTACTTTGTTGAGAGTCTCATTCTTCCCTGGAGTTAATAGAGTGA 
TTCACAATCTTTGGGGTTTTCTCCTCATCAAAAGCATTTCTTAAGTGCCTATCTAAAAGC 
AATTAAAGACTGTGTCTGCCCTTTAGAAGCTAAGAATTTGATTCATGATGCAAATTAACT 
AGATAATTTGCAT^GTACCCTTGAGATTGAATTTTCTCTATTATATATTTCCCATATTTC 
AGGTGAATAATTTAATTTAAATGACAAAACCCTATCTAGTCAACTGGGCATAATGACATT 
TTCTTTAAATTAGACTCTATTTTGAATTAAAAGAGTTTTATTATAAACCGTGTGTTTTTG 
GTTTTTCTAAGTATATAGAAAGCTTGTATAATTCAGATTTATCAATTTCCTGATTTAATG 
TAGACTTTGACTTTTTTATTAAAAACCTTTGTATTAAAGCAAGTTATGTTATTTTTCTTT 
TATGCATTTATTACTAACATAGCTTTAAATCTTTAAATGTATTGAAGCATTGTGCTGTCT 
GAAAATAAGGAATTGCTTATAAACCAGCCACTTCTGAATACAATATGTAGCTGATTTAAT 
AAGCTAGTTAGTGAATGGAAAATAAGTGTGGAGTATTAAAAATGTTCTTTGGTTGGTAAG 
GCCTAAGATAGGGTTTCATTTATTTCTATACTTTTTCTGTTTTTTAAACACCTGCATATT 
TTTATGTAAATCTCTAAATTTAAAATATTTTAAGTACATTTATTTTTGGTGTTTTATTGT 
ATAAAACCTTAGACAATCAATCAGTCAGTCTTTACTGACT^GGAGCTVGCAGCTATCTGTCT 
TTTGCTGATCTACAAATAAATGAATTGAGAATTTAGTCCATAGAGGTCCCTGGCTACCAA 
ACTICATTCTCCTTTGAATTGTTAAAATTCAGAACATTCAAAATAACTGTTTTGCTACAAC 
CCATGATTATTTTCCTGTTGTGTTTATTTAAATTTACTTTCTCTTTAGAAGTGCACTTAT 
TTGTGAAAAATGTTAATGAAACAAACGCTTAGAACAAATATAAATATGAGACACTTGGGA 
CTACTAGAGATATTTTAGATTTTTATGAAAAAAATGTGAGGGGATATTGCTGCTTTAAAA 

TGTCTAGAAAAGTAATCATGAGGCTACTGAGTTTGGTGTTCAGTTACTGAGTTTCAAAAA 
TGTTTTGGTGGCATGAGGACAAAATTTCATTGAAGGTAAGATAAGAATAAAAACTATGTT 
TAG 

SEQ ID NO: 9_AA210825_H 

CACGAGGGCTACTGGCGCCTGGCGACCCTCCCTGCCCCCCACCCAACCCCGCTCCGGCAA 
CGCCCCCTTCCTCACGGCTCCCGACCGAACTTTTCTCCAACTTCTGCGACTCGTGAGATT 
CCCTTCTACCCACTCCGGCCCTCGGGACCCCTCTGCCCATCCCCTGGCCGGTCGGGTCCC 
TGCGAACCCCTTTATCTCTGGAATCCACTCGGTCCCCGACTCAGAGACTCCTGCCCTCCA 
CCCCC7VAGGACCCCGCCATCCTCAGGTCCCCTCCGCCTGCCAGATCTTTTCTCGGATCCC 
CGCTCTCCCACCACCTGCTCACGAGATCCCGCGGATCTAGAACCCAGGGTCCCCCGGGGC 
CCCCCGGCGGGTCCCGGGTGGGCTCCAGGCGGGCGGTCCCCGGCCTCCCCCCATGGCCAC 
CGCCCCCTCATTATCCCGCCGGGCTCCCTGGCTCTCCCGGGCCGGGGTCTCCTCCGCCCC 
CCGGCGGCCTAGAGCTGCAGTCGCCGCCACCGCTACTGCCCCAGATCCCGGCCCCGGGTT 
CCGGGGTCTCCTTTCACATCCAGATCGGGCTGACCCGCGAGTTCGTGCTGTTGCCCGCCG 
CCTCCGAGCTGGCTCATGTGAAGCAGCTGGCCTGTTCCATCGTGGACCAGAAGTTCCCTG 
AGTGTGGCTTCTACGGCCTTTACGACAAGATCCTGCTTTTCAAACATGACCCCACGTCGG 
CCAACCTCCTGCAGCTGGTGCGCTCGTCCGGAGACATCCAGGAGGGCGACCTGGTGGAGG 
TGGTGCTGTCGGCCTCGGCCACCTTCGAGGACTTCCAGATCCGCCCGCACGCCCTCACGG 
TGCACTCCTATCGGGCGCCTGCCTTCTGTGATCACTGCGGGGAGATGCTCTTCGGCCTAG 
TGCGCCAGGGCCTCAAGTGCGATGGCTGCGGGCTGAACTACCACAAGCGCTGTGCCTTCA 
GCATCCCCAACAACTGTAGTGGGGCCCGCAAACGGCGCCTGTCATCCACGTCTCTGGCCA 
GTGGCCACTCGGTGCGCCTCGGCACCTCCGAGTCCCTGCCCTGCACGGCTG7VAGAGCTGA 
GGCGTAGCACCACCGAACTCCTGCCTCGCCGTCCCCCGTCATCCTCTTCCTCCTCTTCTG 
CCTCATCGTATACGGGCCGCCCCATTGAGCTGGACAAGATGCTGCTCTCCAAGGTCAAGG 
TGCCGCACACCTTCCTCATCCACAGCTATACACGGCCCACCGTTTGCCAGGCTTGCAAGA 
AACTCCTCAAGGGCCTCTTCCGGCAGGGCCTGCAATGCAAAGACTGCAAGTTTAACTGTC 
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ACAAACGCTGCGCCACCCGCGTCCCTAATGACTGCCTGGGGGAGGCCCTTATCAATGGAG 

ATGTGCCGATGGAGGAGGCCACCGATTTCAGCGAGGCTGACAAGAGCGCCCTCATGGATG 

AGTCAGAGGACTCCGGTGTCATCCCTGGCTCCCACTCAGAGAATGCGCTCCACGCCAGTG 

AGGAGGAGGAAGGCGAGGGAGGCAAGGCCCAGAGCTCCCTGGGGTACATCCCCCTAATGA 

GGGTGGTGCAATCGGTGCGACACACGACGCGGAAATCCAGCACCACGCTGCGGGAGGGTT 

GGGTGGTTCATTACAGCT^CAAGGACACGCTGAGAAAGCGGCACTATTGGCGCCTGGACT 

GCAAGTGTATCACGCTCTTCCAGAACAACACGACCAACAGATACTATAAGGAAATTCCGC 

TGTCAGAAATCCTCACGGTGGAGTCCGCCCAGAACTTCAGCCTTGTGCCGCCGGGCACCA 

ACCCACACTGCTTTGAGATCGTCACTGCCAATGCCACCTACTTCGTGGGCGAGATGCCTG 

GCGGGACTCCGGGTGGGCCAAGTGGGCAGGGGGCTGAGGCCGCCCGGGGGCTGGNNGAGA 

CAGCCATCCGCCAGGCCCTGATGCCCGTCATCCTTCAGGACGCACCCAGCGCCCCAGGCC 

ACGCGCCCCACAGACAAGCTTCTCTGAGGATCTCTGTGTCCAACAGTCAGATCCAAGAGA 

ATGTGGACATTGCCACTGTCTACCAGATCTTCCCTGACGAAGTGCTGGGCTCAGGGCAGT 

TTGGAGTGGTCTATGGAGGAAAACACCGGAAGACAGGCCGGGACGTGGCAGTTAAGGTCA 

TTGACAAACTGCGCTTCCCTACCAAGCAGGAGAGCCAGCTCCGGAATGAAGTGGCCAT^ 

TGCAGAGCCTGCGGCATCCCGGGATCGTGT^CCTGGAGTGCATGTTCGAGACGCCTGAGA 

AAGTGTTTGTGGTGATGGAGAAGCTGCATGGGGACATGTTGGAGATGATCCTGTCCAGTG 

AGAAGGGCCGGCTGCCTGAGCGCCTCACCAAGTTCCTCATCACCCAGATCCTGGTGGCTT 

TGAGACACCTTCACTTCAAGAACATTGTCCACTGTGACTTGAAACCAGAAAACGTGTTGC 

TGGCATCAGCAGACCCATTTCCTCAGGTGAAGCTGTGTGACTTTGGCTTTGCTCGCATCA 

TCGGCGAGAAGTCGTTCCGCCGCTCAGTGGTGGGCACGCCGGCCTACCTGGCACCCGAGG 

TGCTGCTCAACCAGGGCTACAACCGCTCGCTGGACATGTGGTCAGTGGGCGTGATCATGT 

ACGTCAGCCTCAGCGGCACCTTCCCTTTCAACGAGGATGAGGACATCAATGACCAGATCC 

AGAACGCCGCCTTCATGTACCCGGCCAGCCCCTGGAGCCACATCTCAGCTGGAGCCATTG 

ACCTCATCAAO^ACCTGCTGCAGGTGAAGATGCGCAAACGCTACAGCGTGGACT^ 

TCAGCCACCCCTGGTTACAGGAGTACCAGACGTGGCTGGACCTCCGAGAGCTGGAGGGGA 

AGATGGGAGAGCGATACATCACGCATGAGAGTGACGACGCGCGCTGGGAGCAGTTTGCAG 

CAGAGCATCCGCTGCCTGGGTCTGGGCTGCCCACGGACAGGGATCTCGGTGGGGCCTGTC 

CACCACAGGACCACGACATGCAGGGGCTGGCGGAGCGCATCAGTGTTCTCTGAGGTCCTG 

TGCCCTCGTCCAGCTGCTGCCCTCCACAGCGGTTCTTCACAGGATCCCAGCAATGAACTG 

TTCTAGGGAAAGTGGCTTCCTGCCCAAACTGGATGGGACACGTGGGGAGTGGGGTGGGGG 

GAGCTATTTCCAAGGCCCCTCCCTGTTTCCCCAGCAATTAAAACGGACTCATCTCTGGCC 

CCATGGCCTTGATCTCAGCAAAA 

SEQ ID NO: 10_AA127299_H 

ATTCAATTCATAATTGTTGGTGCy^AAAGATTTGCTTGCTATGGATTCAT^TGGT 

GATCCTTACATCAAAATCACAAATCTTTCTCAAAAAACGAAAGTGATTAAGAAAACTTTG 

ACTCCAACTTGGAATGAAACTTTTTTTGTGCATTTTCCAGAAAAAACAACCCTTGAATTA 

GAATGTTGGGACCACGATACTTTTTCAGATGATTTTATTGGCAAGGCTTCCATTTCTTTG 

GCAGAGATTCCAGCTTTGGCAGAAGTTGATATGTGGATAGATATGAAAACGAAAAAAGGA 

GAATTTGCAGGAAAA 

SEQ ID NO: 11_AA316804_H 

ATGTCTGCAAATAATTCCCCTCCATCAGCCCAGAAGTCTGTATTACCCACAGCTATTCCT 
GCTGTGCTTCCAGCTGCTTCTCCGTGTTCAAGTCCTAAGACGGGACTCTCTGCCCGACTC 
TCTAATGGAAGCTTCAGTGCACCATCACTCACCAACTCCAGAGGCTCAGTGCATACAGTT 
TCATTTCTACTGCAAATTGGCCTCACACGGGAGAGTGTTACCATTGAAGCCCAGGAACTG 
TCTTTATCTGCTGTCAAGGATCTTGTGTGCTCCATAGTTTATCAAAAGTTTCCAGAGTGT 
GGATTCTTTGGCATGTATGACAAAATTCTTCTCTTTCGCCATGACATGAACTCAGAAAAC 
ATTTTGCAGCTGATTACCTCAGCAGATGAAATACATGAAGGAGACCTAGTGGAAGTGGTT 
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CTTTCAGCTTTAGCCACTIGTAGAAGACTTCCAGATTCGTCCACATACTCTCTATGTACAT 

TCTTACAAAGCTCCTACTTTCTGTGATTACTGTGGTGAGATGCTGTGGGGATTGGTACGT 

CAAGGACTGAAATGTGAAGGCTGTGGATTAAATTACCATAAACGATGTGCCTTCAAGATT 

CCAAATAACTGTAGTGGAGTAAGAAAGAGACGTCTGTCAAATGTATCTTTACCAGGACCC 

GGCCTCTCAGTTCCAAGACCCCTACAGCCTGAATATGTAGCCCTTCCCAGTGAAGAGTCA 

CATGTCCACCAGGAACCAAGTAAGAGAATTCCTTCTTGGAGTGGTCGCCCAATCTGGATG 

GAAAAGATGGTAATGTGCAGAGTGAAAGTTCCACACACATTTGCTGTTCACTCTTACACC 

CGTCCCACGATATGTCAGTACTGCAAGCGGTTACTGAAAGGCCTCTTTCGCCAAGGAATG 

CAGTGTAAAGATTGCAAATTCAACTGCCATAAACGCTGTGCATCAAAAGTACC;^ 

TGCCTTGGAGAGGTTACTTTCAATGGAGAACCTTCCAGTCTGGGAACAGATACAGATATA 

CCAATGGATATTGACAATAATGACATAAATAGTGATAGTAGTCGGGGTTTGGATGACACA 

GAAGAGCCATCACCCCCAGAAGATAAGATGTTCTTCTTGGATCCATCTGATCTCGATGTG 

GAAAGAGATGAAGAAGCCGTTAAAACAATCAGTCCATCAACAAGCAATAATATTCCGCTA 

ATGAGGGTTGTACAATCCATCAAGCACACAAAGAGGAAGAGCAGCAC^ 

GGGTGGATGGTCCATTACACCAGCAGGGATAACCTGAGAAAGAGGCATTATTGGAGACTT 

GACAGCAAATGTCTAACATTATTTCAGAATGAATCTGGATCAAAGTATTATAAGGAAATT 

CCACTTTCAGAAATTCTCCGCATATCTTCACCACGAGATTTCACAAACATTTCACAAG 

AGCAATCCACACTGTTTTGAAATCATTACTGATACTATGGTATACTTCGTTGGTGAGAAC 

AATGGGGACAGCTCTCATAATCCTGTTCTTGCTGCCACTGGAGTTGGACTTGATGTAGCA 

CAGAGCTGGGAAAAAGCAATTCGCCAAGCCCTCATGCCTGTTACTCCTCAAGCAAGTGTT 

TGCACTTCTCCAGGGCAAGGGAAAGATCACAAAGATTTGTCTACAAGTATCTCTGTATCT 

AATTGTCAGATTCAGGAGAATGTGGATATCAGTACTGTTTACCAGATCTTTGCAGATGAG 

GTGCTTGGTTCAGGCCAGTTTGGCATCGTTTATGGAGGAAAACATAGAAAGACTGGGAGG 

GATGTGGCTATTAAAGTAATTGATAAGATGAGATTCCCCACAAAACAAGAAAGTCAACTC 

CGTAATGAAGTGGCTATTTTACAGAATTTGCACCATCCTGGGATTGTAAACCTGGAATGT 

atgtttgaaaccccagaacgagtctttgtagtaatggaaaagctgcatggagatatgttg 
gaaatgattctatccagtgagaaaagtcggcttccagaacgaattactaaattcatggtc 
acacagatacttgttgctttgaggaatctgcattttaagaatattgtgcactgtgattta 
aagccagaaaatgtgctgcttgcatcagcagagccatttcctcaggtgaagctgtgtgac 
tttggatttgcacgcatcattggtgaaaagtcattcaggagatctgtggtaggaactcca 
gcatacttagcccctgaagttctccggagcaaaggttacaaccgttccctagatatgtgg 
tcagtgggagttatcatctatgtgagcctcagtggcacatttccttttaatgaggatg;^ 
gatataaatgaccaaatccaaaatgctgcatttatgtacccaccaaatccatggagagaa 

ATTTCTGGTGAAGC^TTGATCTGATAAACAATCTGCTTCAAGTGAAGATGAGAAAACGT 
TACAGTGTTGACAAATCTCTTAGTCATCCCTGGCTACAGGACTATCAGACTTGGCTTGAC 
CTTAGAGAATTTGAAACTCGCATTGGAGAACGTTACATTACTVCATGA^ 

cgctgggaaatacatgcj^tacacacataaccttgtatacccaaagcacttcattatggct 
cctaatccagatgatatggaagaagatccttaa 

seq id no: 12_pknbeta_h 

atggaggagggggcgccgcggct^gcctgggccgagccagtggcccccagaggatgagaag 
gaggtgatccgccgggccatccagaaagagctgaagatcaaggagggggtggagaacctg 
cggcgcgtggccacagaccgccgccacttgggccatgtgcagcagctgctgcggtcctcc 
aaccgccgcctggagcagctgcatggcgagctgcgggagctgcacgcccgaatcctgctg 
cccggccctgggcctggcccagctgagcctgtggcctcaggaccccggccgtgggcagag 
cagctcagggctcggcacctagaggctctccggaggcagctgcatgtggagctgaaggtg 
aaacagggggctgagaacatgacccacacgtgcgccagtggcacccccaaggagaggaag 
ctccttgcagctgcccagcagatgctgcgggacagccagctgaaggtggccctgctgcgg 
atgaagatcagcagcctggaggccagtgggtccccggagccagggcctgagctactggcg 
gaggagctacagcatcgactgcacgttgaggcagcggtggctgagggcgccaagaacgtg 
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gtgaaactgcttagtagccggagaacacaggaccgcaaggc7vctggctgaggcccaggcc 

cagctacaggagtcctctcagaaactggacctcctgcgcctggccttggagcagctgctg 

gagcaactgcctcctgcccaccctttgcgcagcagagtgacccgagagttgcgggctgcg 

gtgcctggatacccccagccttcagggacacctgtgj^gcccaccgccctaacagggaca 

ctgcaggtccgcctcctgggctgtgaacagttgctgacagccgtgcctgggcgctcccca 

gcggccgcactggccagcagcccctccgagggctggcttcggaco^ggccaagcacc^^ 

cgtggccgaggcgagcttgccagtgaggtgctggctgtgctaaaggtggacaaccgtgtt 

gtggggcagacgggctgggggcaggtggccgaacagtcctgggaccagacctttgtcatc 

ccactggagcgagcccgtgagctggagattggggtacactggcgggactggcggcagcta 

tgtggcgtggccttcctgagacttgaagacttcctggacaatgcctgtcaccaactgtcc 

ctcagcctggtaccgcagggactgctttttgcccaggtgaccttctgcgatcctgtcatt 

gagaggcggccccggctgcagaggcaggaacgcatcttctctaaacgcagaggccaggac 

ttcctgaggcgttcgcagatgaacctcggcatggcggcctgggggcgcctcgtcatgaac 

ctgctgcccccctgovgctccccgagcacaatcagcccccctaaaggatgccctcggacc 

ccaacaactvctgcgagaggcctctgaccctgccactcccagtaatttcctgcccaag;^ 

acccccttgggtgaagagatgacacccccacccaagcccccacgcctctacctcccccag 

gagccaacatccgaggagactccgcgcaccaaacgtccccatatggagcctaggactcga 

cgtgggccatctccaccagcctcccccaccaggaaaccccctcggcttcaggacttccgc 

tgcttagctgtgctgggccggggacactttgggaaggtcctcctggtccagttcaagggg 

acagggaaatactacgccatcaaagcactgaagaagcaggaggtgctcagccgggacgag 

atagagagcctgtactgcgagaagcggatcctggaggctgtgggctgcacagggcaccct 

TTCCTGCTCTCCCTCCTTGTCTGCTTCCAGACCTCCAGCCATGCCCGCTTTGTGACTGAG 

tttgtgcctggtggtgacctcatgatgcagatccacgaggatgtcttccccgagccccag 

GCCCGCTTCTACGTGGCTTGTGTTGTCCTGGGGCTGCAGTTCTTACACGAGAAGAAGATC 

atttacagggacctgaagttggataaccttctgctggatgcccagggattcctgaagatc 
gcagactttggactctgct^ggaagggatcggcttcggggaccggactagcaccttctgt 
ggcaccccggagttcctggctcccgaggtgctgacccaggaggcatacacacaggccgtc 
gactggtgggcgctgggtgtgctgctctacgagatgctggtgggtgagtgcccgttccca 
ggggacacagaggaagaggtgtttgactgcatcgtcaacatggacgccccctaccccggc 
tttctgtcggtgcaagggcttgagrtcattcagaagctcctccagaagtgcccggagaag 
cgcctcggggcaggtgagcaggatgccgaggagatcaaggtccagccattcttcaggacc 
accaactggcaagccctgctcgcccgovccatcctigcccccottcgtgcctaccctgtgt 
ggccctgcggacctgcgctactttgagggcgagttcacagggctgccgcctgccctgacc 
ccacctgcaccccacagcctcctcactgcccgccaacaggccgccttccgggacttcgac 
tttgtgtcagagcgattcctggaaccctga 

seq id no: 13_ai021023_m pknbeta_m 

gctgaagtgggataaccttctgctggatgcccagggattcctgaagatcgcagactttgg 

actctgcaaggaagggatcggcttcggggaccggactagcaccttctgtggcaccccgga 

gttcctggctcccgaggtgctgacccaggaggcatacacacgggctgtggactggtgggg 

gctgggtgtgctgctctacgagatgctggtgggtgagtgcccgttcccaggggacacaga 

ggaagaggtgtttgactgcatcgtcaacatggacgccccctaccccggctttctgtcggt 

gcaagggcttgagttcattcagaagctcctccagaagtgcccggagaagcgcctcggggc 

gggtgagcaggatgccgacxsagatcaaggtccagccattcttcaggaccacc^ 

agccctgctcgcccgcaccatccagcccccctttgtgcctaccctgtgtggccctgcgga 

cctgcgctactttgagggcgagttcacagggctgccgcctgccctgaccccacctgcacc 

ccacagcctcctcactgcccgccaacaggccgccttccgggacttcgactttgtgtcaga 

gcgattcctggaaccctgagggcatctcctggcacctctgtccccttcccccacagactg 

ttagagcctctgctcgttcacccgtgcgccctgcctggaggtccaggccttgctgggtac 

ttctgagcccttgggattcaaagtggcagccatggggccactgttgtgggctttgctcag 
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TGTCACTGGGCAAAGTGTGTCCCTTCCCCCTCCAGCTCGCCCTCTTCTACCTCCCAGCGA 
GACCTGGCCCAGAAAGGGTGCCGCAGCAAGGAGTGATATGGTTTGTCTTTTTAAGACTGG 
ACTTGCTTTATATTAAATTTGTAAAAGTG 

SEQ ID NO: 14_H19102_H 

GGTGGCAACATCCGGGGTCCCTGGGCCCGAGGCTGGAAGAGCCTCTGGACAGGTTTGGGA 

ACCJ^TCAGGTCAGATCTGGAAGAACTCTGGGAACTACGGGGGCACCACTATCTGCACCAG 

GAATCCCTAAAGCCAGCCCCAGTACTGGTAGAGAAGCCTCTGCCAGAGTGGCCAGTGCCT 

CAGTTCATCAACCTCTTTCTACCAGAGTTTCCCATTAGGCCCATTAGGGGGCAGCAGCAG 

CTGAAGATTTTAGGCCTCGTGGCTAAAGGCTCCTTTGGAACTGTCCTCAAGGTGCTAGAT 

TGCACCCAGAAAGCTGTATTTGCAGTGAAGGTGGTGCCCAAGGTAAAGGTCCTACAGAGG 

GATACCGTGAGGCAGTGCAAAGAGGAGGTTAGCATCCAGCGACAGATC;^ 

GTACACAGCTTGGGGGACAGCTGGCT^GGGAAAACGGCACCTTTTCATTATGTGTAGCTAC 

TGCAGCACAGATCTGTACTCCCTTTGGTCGGCTGTTGGCTGCTTTCCTGAGGCTTCCATC 

CGTCTCTTTGCTGCCGAGTTGGTGCTGGTACTGTGTTATCTCCATGACTTGGGCATCATG 

CATCGAGATGTGAAGATGGAGAATATTCTTCTAGATGAACGAGGCCATCTGAAACTGACA 

GACTTTGGTCTGTCCCGCCACGTGCCCCAGGGAGCTCAAGCCTACACTATCTGTGGCACT 

CTTCAGTACy^TGGCCCCAGAGGTCCTAAGTGGAGGACCTTACAACCATGCTGCTGATTGG 

TGGTCCCTGGGTGTCTTGCTTTTCTCTCTGGCGACTGGAAAGTTTCCAGTGGCTGCAGAG 

AGAGATCATGTGGCCATGTTGGCAAGTGTGACCCACAGTGACTCTGAGATCCCAGCTTCT 

CTTAACCAGGGCCTCTCACTCCTGCTCCATGAGCTCTTATGCCAGAACCCCCTCCATCGT 

CTACGTTATCTGCATCACTTCCAGGTCCACCCTTTCTTTCGGGGTGTGGCCTTCGACCCA 

GAGCTCCTACAGAAGCAGCCAGTGAACTTTGTCACGGAGACTVCAAGCTACCCAGCCCAGT 

TCAGCGGAGACCATGCCCTTTGACGACTTTGACTGTGATCTGGAGTCCTTCTTGCTCTAC 

CCTATCCCTGCTTGA 

SEQ ID NO: 15__AA476563_H 

ATGGAATTCTTTAGGATAGACAGTAAGGATAGCGCAAGTGAACTCCTGGGACTTGACTTT 

GGAGAAAAATTGTATAGTCTAAAATCAGAACCTTTGAAACCATTCTTTACTCTTCCAGAT 

GGAGACAGTGCTTCTAGGAGTTTTAATACTAGTGT^AAGCAAGGTAGAGTTTAAAGCTCAG 

GACTVCCATTAGCAGGGGCTCAGATGACTCAGTGCCAGTTATTTCGTTTAAAGATGCTGCT 

TTTGATGATGTCAGTGGTACTGATGAAGGAAGACCTGATCTTCTTGTAAATTTACCTGGT 

GAATTGGAGTCAACAAGAGAAGCTGCAGCAATGGGACCTACTAAGTTTACAC^^ 

ATAGGGATAATAGT^AAATAAACTCTTGGAAGCCCCTGATGTTTTATGCCTCAGGCTTAGT 

ACTGAACAATGCCAAGCACATGAGGAGAAAGGCy^TAGAGGAACTGAGTGATCCCTCTGGG 

CCCAAATCCTATAGTATAACAGAGAAACACTATGCACAGGAGGATCCCAGGATGTTATTT 

GTAGCAGCTGTTGATCATAGTAGTTCAGGAGATATGTCTTTGTTACCCAGCTCAGATCCT 

AAGTTTCAAGGACTTGGAGTGGTTGAGTCAGCAGTAACTGCAAACAACACAGAAGAAAGC 

TTATTCCGTATTTGTAGTCCACTCTCAGGTGCTAATGAATATATTGCAAGCACAGACACT 

TTAAAAACAGAAGAAGTATTGCTGTTTACAGATCAGACTGATGATTTGGCTAAAGAGGAA 

CCAACTTCTTTATTCCAGAGAGACTCTGAGACTAAGGGTGAAAGTGGTTTAGTGCTAGAA 

GGAGACAAGGAAATACATCAGATTTTTGAGGACCTTGATAAAAAATTAGCACTAGCCTCC 

AGGTTTTACATCCCT^GAGGGCTGCATTCAAAGATGGGCAGCTGAAATGGTGGTAGCCCTT 

GATGCTTTACATAGAGAGGGAATTGTGTGCCGCGATTTGAACCCAAACAACATCTTATTG 

AATGATAGAGGACACATTCAGCTAACGTATTTTAGCAGGTGGAGTGAGGTTGAAGATTCC 

TGTGACAGCGATGCCATAGAGAGAATGTACTGTGCCCCAGAGGTTGGAGCAATCACTGAA 

GAAACTGAAGCCTGTGATTGGTGGAGTTTGGGTGCTGTCCTCTTTGAACTTCTCACTGGC 

AAGACTCTGGTTGAATGCCATCCAGCAGGAATAAATACTCACACTACTTTGAACATGCCA 

GAATGTGTCTCTGAAGAGGCTCGCTCACTCATTCAACAGCTCTTGCAGTTCAATCCTCTG 
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GAACGACTTGGTGCTGGAGTTGCTGGTGTTGAAGATATCAAATCTCATCCATTTTTTACC 
CCTGTGGATTGGGCAGAACTGATGAGATGA 

SEQ ID NO: 16_AA626690_H 

ATGCTACCATTCGCTCCTCAGGACGAGCCCTGGGACCGAGAAATGGAAGTGTTCAGCGGC 

GGCGGCGCGAGCAGCGGCGAGGTAAATGGTCTTAAAATGGTTGATGAGCCAATGGAAGAG 

GGAGAAGCAGATTCTTGTCATGATGAAGGAGTTGTTAAAGAAATCCCTATTACTCATCAT 

GTTAAGGAAGGCTATGAGAAAGCAGATCCTGCACAGTTTGAGTTGCTCAAGGTTCTTGGT 

CAGGGGTCATTTGGAAAGGTTTTTCTTGTTAGAAAGAAGACCGGTCCTGATGCTGGGCAG 

CTCTATGCAATGAAGGTGTTAAAAAAAGCCTCTTTAAAAGTTCGAGACAGAGTTCGGACA 

AAGATGGAGAGGGATATACTGGTGGAAGTAAATCATCCATTTATTGTCAAATTGCACTAT 

GCCTTTCAGACTGAAGGGAAACTGTACTTAATACTGGATTTTCTCAGGGGAGGAGATGTT 

TTCACAAGATTATCCAAAGAGGTTCTGTTTACAGAGGAAGATGTGAAATTCTACCTCGCA 

GAACTGGCCCTTGCTTTGGATCATCTGCACCAATTAGGAATTGTTTATAGAGACCTGAAG 

CCAGAAAACATTTTGCTTGATGAAATAGGACATATCAAATTAACAGATTTTGGACTCAGC 

AAGGAGTCAGTAGATC7UVGAAAAGAAGGCTTACTCATTTTGTGGTACAGTAGAGTATATG 

GCTCCTGAAGTAGTAAATAGGAGAGGCCATTCCCAGAGTGCTGATTGGTGGTCATATGGT 

GTTCTTATGTTTGAAATGCTTACTGGTACTCTGCCATTTCAAGGTAAAGACAGAAATGAG 

ACCATGAATATGATATTAAAAGCT^AAACTTGGAATGCCTCT^TTTCT^ 

CAAAGTCTTCTAAGGATGTTATTCAAAAGGAATCCAGCAAATAGATTGGGATCAGAAGGA 

GTTGAAGAAATCAAAAGACATCTGTTTTTTGCAAATATTGACTGGGATAAATTATA 

AGAGAAGTTCT^CCTCCTTTCAAACCTGCTTCTGGAAAACCTVGATGATACTTTTO 

GATCCTGAATTTACTGCAAAAACACCTAAAGATTCTCCCGGTTTGCCAGCCAGTGCAAAT 

GCTCATCAGCTCTTCAAAGGATTCAGCTTTGTTGCAACTTCTATTGCAGAAGAATATA^ 

ATCACTCCTATCACAAGTGCAAATGTATTACCAATTGTTCAGATAAAT 

CAATTTGGTGAAGTATATGAATTGAAGGAGGATATTGGTGTTGGCTCCTACTCTGTTTGC 

AAGCGATGCATACATGCAACTACCAACATGGAATTTG(^GTGAAGATCATTGACAA^ 

AAGCGAGACCCTTCAGAAGAGATTGAAATATTGATGCGCTATGGACAACATCCCAACATT 

ATTACTTTGAAGGATGTCTTTGATGATGGTAGATATGTTTACCTTGTTACGGATTTAATG 

AAAGGAGGAGAGTTACTTGACCGTATTCTCAAACAAAAATGTTTCTCGGAACGGGAGGCT 

AGTGATATACTATATGTAATAAGTAAGACAGTTGACTATCTTCATTGTCAAGGAGTTGTT 

CATCGTGATCTTAAACCTAGTAATATTTTATACATGGATGAATCAGCCAGTGCAGATTCA 

ATCAGGATATGTGATTTTGGGTTTGCAAAACAACTTCGAGGAGAAAATGGACTTCTCTTA 

ACTCCATGCTACACTGCAAACTTTGTTGCACCTGAGGTTCTTATGCAACAGGGATATGAT 

GCTGCTTGTGATATCTGGAGTTTAGGAGTCCTTTTTTACACAATGTTGGCTGGCTACACT 

CCATTTGCTAATGGCCCCAATGATACTCCTGAAGAGATACTGCTGCGTATAGGCAATGGA 

AAATTCTCTTTGAGTGGTGGAAACTGGGACAATATTTCAGACGGAGCAAAGGATTTGCTT 

TCCCATATGCTTCATATGGACCCACATCAGCGGTATACTGCTGAACAAATATTAAAGCAC 

TCATGGATAACTCACAGAGACCAGTTGCCAAATGATCAGCCAAAGAGAAATGATGTGTCA 

CT^TGTTGTTAAGGGAGCAATGGTTGCAACATACTCTGCCCTGACTCACAAGACCTTTCi^ 

CCAGTCCTAGAGCCTGTAGCTGCTTCAAGCTTAGCCCAGCGACGGAGCATGAAAAAGCGA 

ACATCAACTGGCCTGTAA 

SEQ ID NO: 17_AA215680_H 

ATGAGCCTGGTGGCCTGTGAGTGCCTGCCCAGCCCCGGCCTGGAGCCTGAGCCTTGCTCA 
CGAGCACGGTCCCAAGCTCACGTGTACCTGGAGCAGATTCGCAACAGGGTGGCTCTGGGA 
GTGCCTGACATGACAAAACGTGACTATCTGGTGGATGCGGCCACGCAGATCCGGCTGGCC 
CTGGAGCGCGATGTTAGTGAGGACTATGAGGCGGCCTTCAACCACTATCAGAATGGCGTG 
GACGTGCTGCTCCGTGGCATACACGTTGACCCCAACAAGGAGCGACGTGAGGCTGTGAAG 
CTGAAAATTACCAAATACCTGCGGCGGGCAGAGGAGATCTTCAACTGCCACCTGCAGCGG 
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CCGCTGAGCAGTGGAGCCAGCCCCAGCGCGGGTTTCAGCAGCCTGAGGCTCCGGCCCATT 
CGCACGCTGAGCTCTGCCGTGGAGCAGCTGAGGGGCTGCAGGGTGGTCGGGGTCATCGAG 
AAGGTGCAGCTGGTCCAGGACCCGGCAACCGGAGGGACCTTTGTGGTGAAGAGCCTACCC 
AGGTGCCACAtGGTGAGCAGGGAGCGGCTGACCATCATCCCACACGGAGTCCCCTACATG 
ACGAAGCTGCTCAGGTACTTTGTGAGCGAGGACTCCATCTTCCTGCACCTGGAGCATGTG 
CAAGGAGGCACTCTCTGGTCCCACCTGCTCTCCCAGGCGCACTCCCGACATTCTGGGCTC 
. AGCTCTGGCTCTACCCAGGAGAGGATGAAGGCTCAGCTCAACCCCCACCTCAACCTCCTG- 
ACCCCAGCGAGGCTTCCCTCAGGCCATGCCCCTGGCCAGGACAGAATCGCCCTGGAGCCT 
CCTAGGACTTCrCCGAACCTTCTCCTAGCTGGGGAGGCCCCATCCACCAGACCCCAGAGG 
GAGGCTGAAGGTGAACCCACAGCCAGGACCAGCACCTCTGGCTCCTCGGACCTTCCAAAG 
GCCCCAGGTGGCCACCTGCACCTTCAAGCTAGGAGGGCTGGCCAGAACTCAGACGCTGGG 
CCCCCTCGGGGGCTCACTTGGGTTCCTGAGGGGGCCGGCCCGGTGCTAGGGGGCTGTGGC 
CGAGGCATGGATCAGAGCTGCCTGTCAGCAGATGGGGCCGGCCGGGGCTGTGGCAGGGCC 
ACCTGGAGTGTGAGAGAGGAGCAGGTGAAGCAGTGGGCGGCAGAGATGCTGGTAGCGCTG 
GAGGCGCTGCACGAGCAGGGGGTGCTGTGCCGGGACCTCCACCCCGGGAACCTGCTCCTG 
GACCAGGCAGGTCACATCCGGCTCACATATTTTGGCCAGTGGTCAGAGGTGGAGCCCCAG 
TGCTGCGGGGAGGCCGTGGACAATCTCTACAGCGCCCCAGAGGTGGGTGGGATTTCCGAG 
CTGACGGAAGCCTGTGACTGGTGGAGCTTTGGGTCTCTACTGTATGAACTGCTGACGGGA 
ATGGCACTGTCCCAGAGCCACCCTTCAGGAATCCAGGCCCACACCCAGCTCCAGCTGCCC 
GAGTGGCTCAGTCGCCCAGCGGCCTCTCTGCTGACTGAGCTGCTGCAGTTCGAGCCTACC 
CGGGGCCTGGGCATGGGAGAAGGTGGTGTCAGCAAACTCAAGTCCCATCCCTTTTTCAGT 
ACCATCCAATGGAGCAAGCTGGTGGGGTAA 

SEQ ID NO: 18_SGK_H 

ATGACGGTGAAAACTGAGGCTGCTAAGGGCACCCTCACTTACTCCAGGATGAGGGGCATG 
GTGGCAATTCTCATCGCTTTCATGAAGCAGAGGAGGATGGGTCTGAACGACTTTATTCAG 
AAGATTGCCAATAACTCCTATGCATGCAAACACCCTGAAGTTCAGTCCATCTTGAAGATC 
TCCCAACCTCAGGAGCCTGAGCTTATGAATGCCAACCCTTCTCCTCCACCAAGTCCTTC^ 
CAGCAAATO^CCTTGGCCCGTCGTCCAATCCTCATGCTAAACCATCTGACTTTCACT^ 
TTGAAAGTGATCGGAAAGGGCAGTTTTGGAAAGGTTCTTCTAGCAAGACACAAGGCAGAA 
GAAGTGTTCTATGCAGTCAAAGTTTTACAGAAGAAAGCAATCCTGAAAAAGAAAGAGGAG 
AAGCJ^TATTATGTCGGAGCGGAATGTTCTGTTGAAGAATGTGAAGCACCCTTTCCTGGTG 
GGCCTTCACTTCTCTTTCCAGACTGCTGACAAATTGTACTTTGTCCTAGACTACATTAAT 
^ GGTGGAGAGTTGTTCTACCATCTCCAGAGGGAACGCTGCTTCCTGGAACCACGGGCTCGT 
TTCTATGCTGCn^SAAATAGCCAGTGCCTTGGGCTACCTGCy^TTCACTGAAC^ 
- AGAGACTTAAAACCAGAGAATATTTTGCTAGATTCACAGGGACACATTGTCCTTACTGAT 
TTCGGACTCTGCAAGGAGAACATTGAAGACAACAGCACAACATCCACCTTCTGTGGCA 
CCGGAGTATCTCGCACCTGAGGTGCTTCATAAGCAGCCTTATGACAGGACTGTGGACTGG 
TGGTGCCTGGGAGCTGTCTTGTATGAGATGCTGTATGGCCTGCCGCCTTTTTATAGCCGA 
AACACAGCTGAAATGTACGACAACATTCTGAACAAGCCTCTCCAGCTGAAACCAAATATT 
ACAAATTCCGCAAGACACCTCCTGGAGGGCCTCCTGCAGAAGGACAGGACAAAGCGGCTC 
GGGGCCT^GGATGACTTCATGGAGATTAAGAGTCT^TGTCTTCTTCTCCTTAATTAACTGG 
GATGATCTCATTAATAAGAAGATTACTCCCCCTTTTAACCCAAATGTGAGTGGGCCCAAC 
GAGCTACGGCACTTTGACCCCGAGTTTACCGAAGAGCCTGTCCCCAACTCCATTGGCAAG 
TCCCCTGACAGCGTCCTCGTCACAGCCAGCGTCAAGGAAGCTGCCGAGGCTTTCCTAGGC 
TTTTCCTATGCGCCTCCCACGGACTCTTTCCTCTGA 

SEQ ID NO: 1 9_AA1 0 7 5 1 5_M 

CGGGTCGACCCACGCGTCCGCCGGTTTCACTGCTCCCCTCAGTCTCTTTTGGGCTCTTTC 
CGGGCATCGGGACGATGACCGTCA7VAGCCGAGGCTGCTCGAAGCACCCTTACCTACTCCA 
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FIGURE 2N 

GAATGAGGGGAATGGTAGCGATTCTCATCGCTTTTATGAAACAGAGAAGGATGGGCCTGA 

ACGATTTTATTCAGAAGATTGCCAGCAACACCTATGCATGCAAACACGCTGAAGTTCAGT 

CCATTTTGAAAATGTCCCATCCTCAGGAGCCGGAGCTTATGAACGCTAACCCCTCTCCTC 

CGCCAAGTCCCTCTCAACAAATC;^CCTGGGTCCGTCCTCCAACCCTCy\.CGCC^^ 

CCGACTTTCACTTCTTGAAAGTGATCGGAAAGGGCAGTTTTGGAAAGGTTCTTCTGGCTA 

GGCACAAGGCAGAAGAAGTATTCTATGCAGTCAAAGTTTTACAGAAGAAAGCCATCCTGA 

AGAAGAAAGAGGAGAAGCATATTATGTCAGAGCGGAATGTTCTGTTGAAGAATGTGAAGC 

ACCCTTTCCTGGTGGGCCTTCACTTCTCATTCCAGACCGCTGACAAGCTCTACTTTGTCC 

TGGACTACATTAATGGTGGAGAGCTGTTCTACCATCTCCAGAGGGAGCGCTGCTTCCTGG 

AACCACGGGCTCGATTCTACGCAGCTGAAATAGCCAGTGCCCTGGGCTATCTGCACTCCC 

TAAACATCGTTTATAGAGACTTAAAACCTGAGAATATTCTCCTAGACTCCCAGGGGCACA 

TCGTCCTCACTGACNTATTTCAGCTGCGTAGAATCGAGCATAACGGGACAACATCTACCT 

TCTGTGGCACGCCTGAGTATCTGGCTCCTGAGGTCCTCCATAAGCAGCCGTATGACCGGA 

CGGTGGACTGGTGGTGTCTTGGGGCTGTCCTGTATGAGATGCTCTACGGCCTGCCCCCGT 

TTTATAGCCGGAACACGGCTGAGATGTACGACAATATTCTGAACAAGCCTCTCCAGTTGA 

aaccaaatattacaaactcggcaaggcacctcctggaaggcctcctgcaga^ 

cc^gaggctgggtgccaaggatgactttatggagattaagagtcatattttcttctctt 

taattaactgggatgatctcato^taagaagattacacccccatttaaccca;^ 

gtgggcccagtgaccttcggcacttcgatcccgagtttaccgaggagccggtccccagct 

ccatcggcaggtcccctgacagcatccttgtcacggccagtgtgaaggaagcagcag 

CCTTCCTCGGCTTCTCCTATGCACCTCCTGTGGATTCCTTCCTCTGAGTGCTCCCGGGAT 

ggttctgaaggacttcctcagcgtttcctaaagtgttttcgttagcctttggtggagttg 
ccagctgacagaacattttaaaagaatttgcacacctggaagcttggcagtctcgcctgc 
ccggcgtggcgcgacgcagcgcgcgctgcttgatgggagctttccgaagagcacaccctc 

CTCTCAATGAGCTTGTGAGGTCTTCTTTTCTTCTCTTCCTTCCAACGTGGTGCTAGCTCC 

aggcgagcgagcgtgagagtgccgcctgagacagacaccttggtctcagttagaaggaag 
atgcaggtctaagaggaatccccgcaggtctgtctgagctgtgatcaagaatattctgc^ 
atgtgccttttctgagatcgtgttagctccaaagctttttcctatcgcagagtgttcagt 
ttgtgtttgtttgtttttgttttgttttgtttttcccttggcggatttcccgtgtgtgca 
gtggcgtgagtgtgctatgcctgatcacagacggttttgttgtgagcatcaatgtgacac 

TTGCAGGACACTACAATGTGGGACATTGTTTGTTTCTTCCACATTTGGAAGATAAATTTA 

tgtgtagactgttttgtaagatatagttaataactaaaacctattgaaacggtcttgcaa 
tgacgagct^ttcagatgcttaaggaaagcartgctgctacaaatatttctatttttagaa 

AGGGTTTTTATGGACCAATGCCCCAGTTGTCAGTCAAAGCCGTTGGTGTTTTCATTGT^ 

aaaatgtcacctataaaacgggcattatttatgttttttttccctttgttct^tattc^^ 
tgcattcctgattattgtatgtatcgtgtaaaggaagtctgtact^ttgggttataacact 

AGATATTTAAACTTACAGGCTTATTTGTAAACCATCATTTTAATGTACTGTAATTAACAT 

gggttataatatgtacaattcctcctccttaccacacaactttttttgtgtgcgataaa 

CAATTTTGGTTTGCAATAAAATCTTGAAAACT 
SEQ ID NO: 20_AA109508_M 

CCACCTGCAGCGGGAGCGCCGGTTCCTGGAGCCCCGGGCCAGGTTCTACGCTGCTGAGGT 
GGCCAGCGCCATTGGCTACCTGCACTCCCTCAACATCATTTACAGGGATCTGAAACCAGA 
GAACATTCTCTTGGACTGCCAGGGACACGTGGTGCTGACGGATTTTGGCCTCTGCAAGGA 
AGGTGTAGAGCCTGAAGACACCACATCCACATTCTGTGGTACCCCTGAGTACTTGGCACC 
TGAAGTGCTTCGGAAAGAGCCTTATGATCGAGCAGTGGACTGGTGGTGCTTGGGGGCAGT 
CCTCTACGAGATGCTCCATGGCCTGCCGCCCTTCTACAGCCAAGATGTATCCCAGATGTA 
TGAGAACATTCTGCACCAGCCGCTACAGATCCCCGGAGGCCGGACAGTGGCCGCCTGTGA 
CCTCCTGCAAAGCCTTCTCCACAAGGACCAGAGGCAGCGGCTGGGCTCCAAAGCAGACTT 
TCTTGAGATTAAGAACCATGTATTCTTCAGCCCCATAAACTGGGATGACCTGTACCACAA 
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GAGGCTAACTCCACCCTTCAACCCAAATGTGACAGGACCTGCTGACTTGAAGCATTTTGA 

CCCAGAGTTCACCCAGGAAGCTGTGTCCAAGTCCATTGGCTGTACCCCTGACACTGTGGC 

CAGCAGCTCTGGGGCCTCAAGTGaVTTCCTGGGATTTTCTTATGCGCCAGAGGATGATGA 

CATCTTGGATTGCTAGAAGAGAAGGACCTGTGAAACTACTGAGGCCAGCTGGTATTAGTA 

AGGAATTACCTTCAGCTGCTAGGAAGAGCGACTCAAACTAACT^TGGCTTCAACGAGAAG 

CAGGTTTATTTTTTCCAGCACATAAAAGAAAAATAATGTTTCGGAGTCCAGGACTGGCA 

GACAGGTCATCAGATACTCAGAGGCTGTATCTCTGCCCTGCCAACCTTGACAAATGGCTT 

CCAATGTTAGGTTTGCTACAAGATGGTTAGTGGAGCTCTAGCTGCCTATTTTGTGTTTAG 

GGAAGGGAAAATGGAGGAAAGGGGAGAAGAGCAAAGGGCGCTTTTAAAGAGCTTTCCCAA 

AAGCTCCACCCAATGACTTCTGCTTCCATCTCACTAACCACCCACCCCTACCTGGAATGG 

AGGCTGGGAGATGTGGCTTATTTGCTGGGTACGTGACTATCCCTAATAACAAAGGGGTTC 

TGACACTAAGACATTAGGGGAGAATGTTGGGTAGGCAGCCAGCACTCTTTTACCAGAGGG 

CCTCCTGGTGTTTGGATTTTGATCTCAATGTGTAAAATGACAGAGATGTAACAAGCTCAT 

AGGGTATCAATATCTCTTATTGTTCT 

SEQ ID NO: 21_AA887783_H 

CGGATGCATTTNTTGGTGTGCTCTTGAGGGATTAAATGCAAAGAGATCACACCATGGACT 

ACAAGGAAAGCTGCCCAAGTGTAAGNATTCCCAGCTCCGATGAACACAGAGAGAAAAAGA 

AGAGGTTTACTGTTTATAAAGTTCTGGTTTCAGTGGGAAGAAGTGAATGGTTTGTCTTCA 

GGAGATATGCAGAGTTTGATAAACTTTATAACACTTTAAAAAAACAGTTTCCTGCTANGG 

CCCTGAAGATTCCTGCCAAGAGAATATTTGGTGATAATTTTGATCCAGATTTTATTAAAC 

AAAGACGAGCAGGACTAAACGAATTCATTCAGAACCTAGTTAGGTATCCAGAACTTTATA 

ACCATCCAGATGTCAGAGCATTCCTTCAAATGGACAGTCCAAAACACCAGTCAGATCCAT 

CTGAAGATGAGGATGAAAGAAGTTCTCAGAAGCTACACTCTACCTCACAGAACATCAACC 

TGGGACCGTCTGGAAATCCTCATGCCAAACCAACTGACTTTGATTTCTTAAAAGTTATTG 

GAAAAGGCAGCTTTGGCAAGGTTCTTCTTGCAA/^CGGAAACTGGATGGAAAATTTTATG 

CTGTCAAAGTGTTACAGAAAAAAATAGTTCTCAACAGAAAAGAGCAAAAACATATTATGG 

CTGAACGTAATGTGCTCTTGAAAAATGTGAAACATCCGTTTTTGGTTGGATTGCATTATT 

CCTTCCAAACAACTGAAAAGCTTTATTTTGTTCTGGATTTTGTTAATGGAGGGGAGGGAC 

ATGTTGTCTTAACAGATTTTGGGCTTTGTAAAGAAGGAATTGCTATTTCTGACACCACTA 

CCACATTTTGTGGGACACCAGAGTATCTTGCACCTGAAGTAATTAGAAAACAGCCCTATG 

ACAATACTGTAGATTGGTGGTGCCTTGGGGCTGTTCTGTATGAAATGCTGTATGGATTGC 

CTCCTTTTTATTGCCGAGATGTTGCTGAAATGTATGACAATATCCTTCACAAACCCCTAA 

GTTTGAGGCCAGGAGTGAGTCTTACAGCCTGGTCCATTCTGGAAGAACTCCTAGAAAAAG 

ACAGGCAAAATCGACTTGGTGCC^GGAAGACTTTCTTGAAATTCAGAATCATCCTTTTT 

TTGAATCACTCAGCTGGGCTGACCTTGTACAAAAGAAGATTCCACCACCATTTAATCCTA 

ATGTGGCTGGACCAGATGATATCAGAAACTTTGAC7VCAGCATTTACAGAAGAAACAGTTC 

CATATTCTGTGTGTGTATCTTCTGACTATTCTATAGTGAATGCCAGTGTATTGGAGGCAG 

ATGATGCATTCGTTGGTTTCTCTTATGCACCTCCTTCAGAAGACTTATTTTTGTGAGCAG 

TTTGCCATTCAGAAACCATTGAGCAAAATAAGTCTATAGATGGGACTGAAACTTCTATTT 

GTGTGAATATATTCAAATATGTATAACTAGTGCCTCATTTTTATATGTAATGATGAAAAC 

TATGAAAAAATGTATTTTCTTCTATGTGCAAGAAAAATAGGGCATTTCAAAGAGCTGTTT 

TGATTAAAATTTATATTCTTGTTTAATAAGCTTATTTTTAAACAATTTAAAAGCTATTAT 

TCTTAGCATTAACCTATTTTTAAAGAAACCTTTTTTGCTATTGACTGTTTTTTCCCTCTA 

AGTTTACACTAACATCTACCCy^GATAGACTGTTTTTTAACAGTCAATTTCAGTT^^ 

AACATATATTAATACCTTTGTAACTCTTTGCTATGGCTTTTGTTATCACACCAAAACTAT 

GCAATTGGTACATGGTTGTTTAAGAAGAAACCGTATTTTTCCATGATAAATCACTGTTTG 

AAATATTTGGTTCATGGTATGATCGAAATGTAAAAGCATAATTAACACATTGGCTGCTAG 

TTAACAATTGGAATAACTTTATTCTGCAGATCATTTAAGAAGTAACAGGCCGGGCGCGGT 

GGCTCACGCCTGTAATCCCAGCACTTTGGGAGGCTGAGGCGGGCAGATCACCTGAGGTCA 
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FIGURE 2P 

GGAGTTGGAGACCAGCCTGACCAACATGGACAAACCCCGTCTCTACTAAAAATACAAAAT 
TGGCAGGGTGTGGTGGCACATGCCTATAATCCCAGCTACTTGGGAGGCTAAGGCAGGAGA 
ATCGCTTGAACCCGGGAGGCGGAGGTTGCAGTGAGCCGAGATCGCACCATTGCACTCCTG 
CCTGGGCAACAAGAGTGAAACTCCATCTCC 

SEQ ID NO: 22_R47805_H 

ATGGCGCACCAAACGGGCATCCACGCCACGGAAGAGCTGAAGGAATTCTTTGCCAAGGCA 
CGGGCTGGCTCTGTGCGGCTCATCAAGGTTGTGATTGAGGACGAGCAGCTCGTGCTGGGT 
GCCTCGCAGGAGCCAGTAGGCCGCTGGGATCAGGACTATGACAGGGCCGTGCTGCCACTG 
CTGGACGCCCAGCAGCCCTGCTACCTGCTCTACCGCCTCGACTCACAGAATGCTCAGGGC 
TTCGAATGGCTCTTCCTCGCCTGGTCGCCTGATAACTCCCCCGTGCGGCTGAAGATGCTG 
TACGCGGCCACGCGGGCCACAGTGAAAAAGGAGTTTGGAGGTGGCCACATCAAGGATGAG 
CTCTTCGGGACTGTGAAGGATGACCTCTCTTTTGCTGGGTACCAGAAACACCTGTCGTCC 
TGTGCGGCACCTGCCCCGCTGACCTCGGCTGAGAGAGAGCTCCAGCAGATCCGCATTAAC 
GAGGTGAAGACAGAGATCAGTGTGGAAAGCAAGCACCAGACCCTGCAGGGCCTCGCCTTC 
CCCCTGCAGCCTGAGGCCCAGCGGGCACTCCAGCAGCTCTAGCAGAAAATGGTCAACTAC 
ATCCAGATGAAGCTGGACCTAGAGCGGGAAACCATTGAGCTGGTGCACACAGAGCCCACG 
GATGTGGCCCAGCTGCCCTCCCGGGTGCCCCGAGATGCTGCCCGCTACCACTTCTTCCTC 
TACAAGCACACCCATGAGGGCGACCCCCTTGAGTCTGTAGTGTTCATCTACTCCATGCCG 
GGGTACAAGTGCAGCATCAAGGAGCGAATGCTCTACTCCAGCTGCAAGAGCCGCCTCCTC 
GACTCCGTGGAGCAGGACTTCCATCTGGAGATCGCCAAGAAAATTGAGATTGGCGATGGG 
GCAGAGCTGACGGCAGAGTTCCTCTACGACGAGGTGCACCCCAAGCAACACGCCTTCAAG 
CAGGCCTTCGCCAAGCCCAAGGGCCCAGGGGGCAAGCGGGGCCATAAGCGCCTCATCCGC 
GGCCCGGGTGAAAATGGGGATGACAGCTAG 

SEQ ID NO: 23_H60215_H 

CCACGCGTCCGGCGCCGCAGCCATGGAGGGAGGCGGCGGCGGCGGCGGCGGCGGCTCGGG 

TGGCTGCGCTGGGAGGCGGCGGTGAGAGGCTCGCACGCCTCCAGCCCGGCCCCGGCCCCC 

CGGGAGGGAGAGCCGAGCAGCCCCGGCTCTGGGCTACGGACTATGGGCGAATAGCTCTGA 

CCACCCGGCGAAGTGCACACACCCAGAAGCTATGTCCTTCGGC7VGTAAAAGTTTTACAGC 

ACAATATATGTGCTCTGCTCTCCTCCCGCAATCCTGCTCCAAGAGATCTTAAGCTGGAGG 

CACCAGGTCTGAATTCCAGACTCCTCCCCACCACCCACACTTCACCTCCAACTGGA^ 

GACCACAGACCCATTCAGGGAGGCTGGCGGACTCTTCATCCTGGACAGTCCCTTACTGTA 

TGTCAAAGCTGAGAATGAAGCGGAGAGCT^TCAGACAGAGGAGCTGGGGAAACGTCGGCCA 

GGGCCAAGGCTCTAGGAAGTGGGATTTCTGGAAATAATGCAAAGAGAGCTGGACCATTCA 

TCCTTGGTCCCCGTCTGGGCAACTCACCGGTGCCAAGCATAGTGCAGTGTTTGGCGAGGA 

AAGATGGCACGGATGACTTCTATCAGCTGAAGATCCTGACCCTGGAGGAGAGGGGGGACC 

AAGGCATAGAGAGCCAGGAAGAGCGGCAGGGCAAGATGCTGCTGCACACCGAGTACTCAC 

TGCTGTCTCTCCTGCACACGCAGGATGGCGTGGTGCACCACCACGGCCTCTTCCAGGACC 

GCACCTGTGAAATCGTTGAGGACACAGAATCCAGCCGGATGGTTAAGAAGATGAAGAAGC 

GCATCTGCCTCGTCCTGGACTGCCTCTGTGCTCATGACTTCAGCGATAAGACCGCTGACC 

TCATCAACCTGCAGCACTACGTCTITCAAGGAGAAGAGGCTCAGCGAGAGGGAGAC^ 

TAATCTTCTACGACGTGGTCCGCGTGGTGGAGGCCCTGCACCAGAAAAATATCGTGCACA 

GAGACCTGAAGCTGGGGAACATGGTGCTCAACAAGAGGACACATCGGATAACCATCACCA 

ACTTCTGCCTCGGGAAGCATCTGGTGAGCGAGGGGGACCTGCTGAAGGACCAGAGAGGGA 

GCCCTGCCTACATCAGTCCCGACGTGCTCAGCGGCCGGCCGTACCGTGGCAAGCCCAGTG 

ACATGTGGGCCCTGGGCGTGGTGCTCTrCACCATGCTGTATGGCCAGTTCCCCTTCTACG 

ACAGCATCCCGCAGGAGCTCTTCCGCAAGATCAAGGCTGCCGAGTATACCATTCCTGAGG 

ATGGACGGGTTTCTGAGAACACCGTGTGTCTCATCCGGAAGCTGCTGGTCCTTGACCCCC 

AGCAGCGCCTGGCCGCCGCCGACGTCCTGGAGGCCCTCAGTGCCATCATTGCATCATGGC 
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AGTCCCTGTCATCTCTGAGTGGGCCTTTGCAAGTGGTTCCTGACATTGATGACCAAATGA 
GCT^TGCGGATAGCTCCCAGGAGGCGAAGGTGACGGAGGAGTGCTCCCAGTACGAGTTTG 
AGAACTACATGCGTCAGCAGCTGCTGCTGGCCGAGGAGAAGAGCTCCATCCATGACACCC 
GGAGCTGGGTACCCAAGCGGCAGTTCGGCAGCGCACCACCGGTGCGACGGCTGGGCCACG 
ACGCACAGCCCATGACCTCCTTGGACACGGCCATCCTGGCGCAGCGCTACCTGCGGAAAT 
AACAGCCTCAGCCGGGGCCACCAGCACTGCTGCCACTTCTTCCAGCCCCAGCCAAAGGCG 
TGGCTGTCAGGGCTGGGCCCTGTAGTGCTGGACTCTCCCGGGCCACAATAGGGACAGGGC 
AGGGACAGGGACAGCCCAGGTCACACGTGGGGTCT^GCAGAGGTACCACGAAGCTACCTTT 
TGGGATGATTGCTCGATTGTTTGGTTTTTAAATCTGAGAAGCCTAGATAACTAATCTGCT 
TTTAATCACGATGTTTTAATCTACCTCTGTCTCTTTAACCATGCTGTCTCTGGACTGAGC 
AAGAGGGAGGAGGGAGCCTGCTCACCCeACTCCAGGGCCTTCCCCAGCGGCCACCAACTG 
ACCTGGGGCGCTGCTCCCCACAGTCCAAATAAGCTG/^AAGTGCAGCTCGCTGCAGGCCCC 
AGAGCGAGCTTCCCCTCCTCCCTGCTCTCCCAGGCCCCTGCCACAGCCTCTTTCCGTCCC 
TCTCTTTCTGATCCAGGCCCCTCAGTCCAAGCTTTGGAAAACCTTCACCTCATCTTAAAC 
CAAACTCAAATATATTTATTTTTTTACCAT 

SEQ ID NO: 24_SGK324_H 

GCCGCGATGGCCAGCACCAGGAGTATCGAGCTGGAGCACTTTGAGGAACGGGACAAAAGG 

CCGCGGCCGGGGTCGCGGAGAGGGGCCCCCAGCTCCTCCGGGGGCAGCAGCAGCTCGGGC 

CCCAAGGGGAACGGGCTCATCCCCAGTCCGGCGCACAGTGCCCACTGCAGCTTCTACCGC 

ACGCGGACCCTGCTIGGCCCTCAGCTCGGAGAAGAAGGCCAAGAAGGCGCGCTTCTACCGG 

AACGGGGACCGCTACTTCAAGGGCCTGGTGTTTGCCATCTCCAGCGACeGCTTCCGGTCC 

TTCGATGCGCTCCTCATAGAGCTCACCCGCTCCCTGTCGGACAACGTGAACCTGCCCCAG 

GGTGTCCGCACTATCTACACCATCGACGGCAGCCGGAAGGTCACCAGCCTGGACGAGCTG 

CTGGAAGGTGAGAGTTACGTGTGTGCATCCZAATGAACCATTTCGTAAAGTCGATTACACC 

AAAAATATTAATCCAAACTGGTCTGTGAACATCAAGGGTGGGACATCCCGAGCGCTGGCT 

GCTGCCTCCTCTGTGAAAAGTGAAGTAAAAGAAAGTAAAGATTTCATCAAACCCAAGTTA 

GTGACTGTGATTCGAAGTGGAGTGAAGCCTAGAAAAGCCGTGCGGATCCTTCTGAATAAA 

AAGACTGCTCATTCCTTTGAACAAGTCTTAACAGATATCACCGAAGCCATTAAACNAGC^ 

TCAGGAGTCGTCAAGAGGCTCTGCT^CCCTGGATGGAAAGCAGGTGAGAGTTACGTGTGTG 

CATCTGCCAGACTTTTTTGGTGATGACGATGTTTTTATTGCATGTGGACCAGAAAAATTT 

CGTTATGCCCAAGATGACTTTGTCCTGGATCATAGTGAATGTCGTGTCCTGAAGTCATCT 

TATTCTCGATCCTCAGCTGTTAAGTATTCTGGATCCAAAAGCCCTGGGCCCTCTCGACGC 

AGCCAGATTTCTGCTCATGGCAGATCTTCTTCCAATGTAAACGGTGGACCTGAGCTTGAC 

CGTTGCATAAGTCCTGAAGGTGTGAATGGAAACAGATGCTCTGAATCATCAACTCTTCTT 

GAGAAATACAAAATTGGAAAGGTCATTGGTGATGGCAATTTTGCAGTAGTC^ 

ATAGACAGGTCCACTGGAAAGGAGTTTGCCCTAAAGATTATAGACAAAGCa^TG^ 

GGAAAGGAACACCTGATTGAGAATGAAGTGTCAATACTGCGCCGAGTGAAACATCCCAAT 

ATCTVTTATGCTGGTCGAGGAGATGGAAACT^GCT^CTGAGCTCTTTCTGGTGATGGAATTG 

GTCAAAGGTGGAGATCTCTTTGATGCAATTACTTCGTCGACCAAGTACACTGAGAGAGAT 

GGCAGTGCCATGGTGTACAACTTAGCCAATGCCCrCAGGTATCTCCATGGCCTCA^ 

GTGCACAGAGACATCAAACCAGAGAATCTCTTGGTGTGTGAATATCCTGATGGAACCAAG 

TCTTTGAAACTGGGAGACTTTGGGCTTGCGACTGTGGTAGAAGGCCCTTTATACACAGTC 

TGTGGCACACCCACTTATGTGGCTCCARAAATCATTGCTGAAACTGGCTATGGCCTGAAG 

GTGGACATTTGGGCAGCTGGTGTGATCACATACATACTTCTCTGTGGATTCCCACCATTC 

CGAAGTGAGAACAATCTCCAGGAAGATCTCTTCGACCAGATCTTGGCTGGGAAGCTGGAG 

TTTCCGGCCCCCTACTGGGATAACATCACGGACTCTGCCAAGGAATTAATCAGTCAAATG 

CTTCAGGTAAATGTTGAAGCTCGGTGTACCGCGGGACAAATCCTGAGTCACCCCTGGGTG 

TCAGATGATGCCTCCCAGGAGAATAACATGCAAGCTGAGGTGACAGGTAAACTAAAACAG 

CACTTTAATAATGCGCTCCCCAAACAGAACAGCACTACCACCGGGGTCTCCGTCATCATG 
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GTGAGTGG7U!^GGCGGCAGGTCTGGCCTGACTGCGGAGCCGGCCTTGAAGTTTTTGAATTA 

GGTAGCCGGGAGCTGCCCTCACATGGAAGTTGGTGCCTTCCGTAGTCCTATTTCATATGA 

AGATTGGCTTGGCATGTGGAGGGCACTCATTCGGCAACTCCCAGGCTTTGGGCACTGTGT 

GGAGGGGCTTGTGTAGGGACCAGCAGGCCTGGTGTGAGGGGTCCAGGCGTCAAGGAGCTC 

CTGGCTGGGCCCTCTGGGCAGCTGCTTCCACTCTTGTCTCTGCCTTCTCATCTAGAGAGA 

CTCCCAAGCCCTGGAGGGGTGTGTTGTGTTAGGAATTAACTCCCTGCCTACCCCAAGGCC 

TCAGAAATAGATTATTAGAGATGTGAATTATTCTTTGAGACTTGGGATAAGAAACAGCCA 

AAGCTAAACATATTTCAGTTTTAAAAAATCAGTGTTTTATAAAACACAGTTTGGGGCTTT 

TAAAGGTACATAATCAAGGAAAAAAATATATATTCATTTTTCAGGGTTGGTAACATTTTA 

TGAGATGTCAGTGACAACGATGGCCTTATTTTTTTCAGCCTTTTCTTCTTCCAAAATGTT 

TCTTAAGGCAACTCTCCTAAATACATAAACACAACAAATTAAAATGAAAAGTGACA 

AGTAAATGAATCT^AAAGGAAAAAACATTGAACCAGAGGTGAGGGCAGCACACCCGCT^GCA 

GCTGTCCAGGCCTGAGCCAATGCAACCCTGGGCGGGAAGGCCAGCTCACCGTGAGCAGGT 

AGAAGCCAGCCAGCCACCCAGGCAGGGACCTTGGTTCTCCCCACACACTCCCAGGAGCAG 

GGAACAGGGGTGGAGTGGCCTTTCCCAGAGCTGGAGTTGGCTGCAGCAGCTTTCGAATCA 

GACCTGCCAAGGTGATGGGCGTCTGAGTTTCACATCTGGGCCCCCCGTGACCCCACTGAG 

TCCTGACy^GCTAAGGATGGGCCACCTCCACAGCTCCGTCACTCGTACTTGGGACAGGCCT 

CTCATCCTCTGGGAAGGTCCTCCTTGTTTCCTACCCAACTAGAAGGGAAACAGTGGGATA 

TTCTCATGGTACATGGTTGTCTGAAAGCCTTACCTAGGAAGACGCAGGGTCTAGATAGAA 

GCTATAAGGAAGCCACACACATAACCCACATCCCCACACCCCCAA^ 

CCACACCCCCCACACCCCCCACATCCCCACCATAATTACCCCCACCTCCAAATATCTCAT 

SEQ ID NO: 25_W30246_M SGK324_M 

ACCAAGTCCTCCAGCTCCTCTCCAACCAGCCCGGGAAGTTTCAGAGGATTGAAGATTTCT 

GCTCAGGGCyiGATCTTCTTCCAACGTAAACGGTGGGCCTGAACTTGACCGTTGCCTG^ 

CCTGAAGGTGTGAATGGAAACCGGTGCTCCGAGTCGTTCCCCCTTCTGGAGAAATACAGA 

ATAGGGAAGGTCATCGGGGACGGCAACTTCGCGGTAGTTAAGGAGTGCGTGGACAGGTAC 

ACTGGAAAAGAGTTTGCATTAAAGATTATAGACAAAGCCAAATGCTGTOGTVAAG^ 

CTGATTGAGAACGAAGTGTCT^TCCTGCGCCGAGTGAAGCACCCCAACATCTITCATGTTG 

GTTGAAGAGATGGAAACAGCAACTGACCTCTTTCTAGTGATGGAACTGGTCAAAGGTGGA 

GATCTCTTTGATGCGATTACCTCTTCAACCAAGTACACTGAGAGAGATGGAAGCGCCATG 

GTGTACAACCTAGCCAATGCCCTCCGGTACCTGCACAGCCTCAGCATCGTCCACAGGGAC 

ATCAAGCCTGAGAATCTGCTGGTGTGCGAATACCCAGATGGAACCAAGTCTTTGAAGCTG 

GGAGACTTTGGGCTGGCGACGGTGGTTGAAGGCCCGTTGTACACGGTCTGTGGCACGCCA 

ACTTATGTGGCACCAGAGATCATAGCTGAAACAGGTTATGGCCTGAAGGTGGATGTTTGG 

GCAGCTGGTGTGATTACATACATACTTCTCTGTGGATTCCCACCATTCCGGAGTGAGAAC 

AATCTCCAGGAAGATCTCTTTGAC(2AGATCTTGGCTGGAAAGCTGGAATTCCCAGCCCCC 

TACTGGGACAACATTAC7VGACTCTCCTTGTGTGTGTTTTAGGAAATGCTTATGAAGCTGG 

CCCGTGGGCTTCCCAGTGGGACGTGCAGCAGTTCTTGGCAGAGCAGGGCCAGCTCTGCTG 

TGTCATCTCCAGGGTCTCCCATCACCTCTGCTCTTTGCCATGGCAGGTCTGCTGAGACCC 

CGCGGGGACGGGGGCATGGTGCTCCCTGATTGGCCTGTGACCAACCTTCTGGAAGGCTGC 

TGGCAGTTTTCCCTGTTTTCCACCACCCCACTCTTTTTAATAATTGTATATAACTGTACT 

TGTTCTACTTGCTTGTCTTTAAAACAGGGGCCCCCACAGTTCACTCTCACTGTTAGATTT 

TGCCTTTTCCAGGTATCCCCAACCTGCAATAAACTCTTCCCTCTTCAG 

SEQ ID NO: 26_AA383293_H 

CCAGCAGCCAAGAGGGTAGTGGTGTACCGGAATGGGGACCCATTCTTCCCAGGCTCCCAG 
CTGGTGGTGACTCAACGCCGCTTCCCCACCATGGAGGCCTTCCTCTGCGAGGTGACATCA 
GCTGTGCAGGCCCCACTGGCTGTGCGTGCCCTCTACACACCTTGTCATGGCCACCCTGTC 
ACCAACCTGGCAGACTTGAAGAACAGAGGGCAGTATGTGGCCGCTGGATTTGAACGATTC 
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CACAAGCTCCCCCCTTACCAGGCTTTTTGTCTCAGTGTGTTCAGGAATGGGGACCTGGTA 

AGTCCCCGATTTAGTCTGAAGCTGTCCCAGGCTGCCAGCCAGGACTGGGAAACTGTGTTG 

AAGCTCCTGACTGAGAAGGTCAAGTTGCAGAGTGGGGCTGTGAGACTCTGCACCCTAGAG 

GGGCTCCCACTGTCAGCAGGGAAGGAGCTGGTAACTGGCCATTACTATGTGGCTGTCGGA 

GAGGATGAGTTCAAGGACCTTCCCTATCCAGCTCTGTCCACAAGAGGGCTCCTGGCAGCA 

GGCAATGAAGCCCACCTGAGGAGTGGAGTGGGGACTGTCGCTGGTTCCCCCAAGCCTCTT 

GGAAGGAAGGCTAAGAAGGAGACATGCCTAATCGTGACCCTGACCCTGAAATACCAGCAG 

TCAGAAACAAGCAGAGACGGGCAATCATTCCCATCAGGAGTTATAGGAGTATATGGAGCT 

CCCCACCGAAGGAAGGAGACAGCGGGGGCCCTGGAAGTAGCAGATGATGAAGACACTCAG 

ACAGAGGAGCCCTTGGATCAGAGGGCAGCACAGATAGTGGAACAGGTTACTTGTCTGCAA 

GACTTTTTTGGTGATGACGATGTTTTTATTGCATGTGGACCAGAAAAATTTCGTTATGCC 

CAAGATGACTTTGTCCTGGATCATAGTCGTCGACGGCTCCTGAGAGAGCACCAGG 

TTTGAGAAGCTCCGCAGGACCCGAGGAGAAGAGAAGGAGGCAGAGAAGGAGAAAAAGCCA 

TGTATGTCTGGAGGCAGAAGGATGACTCTCAGAGATGACCAACCTGCAAAGCTAGJ^^ 

GAGCCCAAGACGAGGCCAGAAGAGAACAAGCCAGAGCGGCCCAGCGGTCGGAAGCCACGG 

CCCATGGGCATCATTGCCGCCAATGTGGAAAAGCATTATGAGACTGGCCGGGTCATTGGG 

GATGGGAACTTTGCTGTCGTGAAGGAGTGCAGACACCGCGAGACCAGGCAGGCCTATGCG 

ATGAAGATCATTGACAAGTCCAGACTCAAGGGCAAGGAGGACATGGTGGACAGTGAGATC 

TTGATCATCCAGAGCCTCTCTCACCCCAACATCGTGAAATTGCATGAAGTCTACGAAACA 

GACATGGAAATCTACCTGATCCTGGAGTACGTGCAGGGAGGAGACCTTTTTGACGCCATC 

ATAGAAAGTGTGAAGTTCCCGGAGCCCGATGCTGCCCTCATGATCATGGACTTATGCAAA 

GCCCTCGTCCACATGCACGACAAGAGCATTGTCCACCGGGACCTCAAGCCGGAAAACCTT 

TTGGTTCAGCGAAATGAGGACAAATCTACTACCTTGAAATTGGCTGATTTTGGACTTGCA 

AAGCATGTGGTGAGACCTATATTTACTGTGTGTGGGACCCCAACTTACGTAGCTCCCGAA 

ATTCTTTCTGAGAAAGGTTATGGACTGGAGGTGGACATGTGGGCTGCTGGCGTGATCCTC 

TATATCCTGCTGTGTGGCTTTCCCCCATTCCGCAGCCCTGAXXGAGGGGACCAGGACGAG 

CTCTTTAACATCATCCAGCTGGGCCACTTTGAGTTCCTCCCCCCTTACTGGGACAATATC 

TCTGATGCTGCTAAAGATCTGGTGAGCCGGTTGCTGGTGGTAGACCCCAAAAAGCGCTAC 

ACAGCTCTITCAGGTTCTTCAGCACCCCTGGATCGAAACAGCTGGCAAGACCAATACAGTG 

AAACGACAGAAGCAGGTGTCCCCCAGCAGCGATGGTCACTTCCGGAGCCAGCACAAGAGG 

GTTGTGGAGCAGGTATCATAGTCACCACCTTGGGAATCTGTCCAGCCCCCAGTTCTGCTC 

AAGGACAGAGAAAAGGATAGAAGTTTGAGAGAAAAACAATGAAAGAGGCTTCTTCACATA 

ATTGGTGAATCAGAGGGAGAGACACTGAGTATATTTTAAAGCATATTAAAAAAATTAAGT 

CAATGTTAAATGTCACAACATATTTTTAGATTTGTATATTTAAAGCCTTTAATACATTTT 

TGGGGGGTAAGCATTGTCATCAGTGAGGAATTTTGGTAATAATGATGTGTTTTGCTTCCC 

CTTTGTAACCAAGTTTATTCTGTACTACAGGAGTGGTGCTTACCAGGGTCTAT^CTCCCC 

CTGTGAGATTAATAAGGTGCATTG 

SEQ ID NO: 28_AA197883_M 

ATGCCAACCGCGCCGGTCCTGCGCCCGCCGCCGCCGCCAGCGACCCCCGCCCCGCCGGCA 
CCCAGTCGCCCTGCGCCTCCCATTCCGGGCCACCGAGGCCCATGTGACCATTCTCTGAAA 
TGCTTAAGCTCGAAGATCTCTGAGAGAAAGCTGCCAGGCCCCTGGTTACCTGCGGGACGA 
GGACCTCTGGAGAAGCCAGTTCTGGGGCCACGTGGTGCCGTCATGCCGCTGTTCAGCCCT 
CAGAGCAGCCTCCACTCAGTCCGCGCAGAGCACAGCCCACTGAAGCCCAGGGTGGTGACG 
GTGGTGAAGCTGGGTGGGCAGCCCCTCCGTAAGGCCACCCTGCTCCTCAACCGGCGCTCA 
GTGCAGACCTTTGAGCAGCTCCTATCAGACATCTCCGAAGCCTTGGGCTTCCCACGCTGG 
AAGAACGACCGTGTGCGGAAGCTGTTCACCCTCAAGGGCAGGGAGGTGAAGAGTGTGTCT 
GACTTCTTCCGGGAGGGTGATGCTTTCATAGCTATGGGCAAAGAGCCGCTGACATTGAAG 
AGTATCCAGTTGGCCATGGAGGAGCTGTATCCTAAGAACCGGGCTCTTGCCCTGGCCCCT 
CACAGTAGAGTCCCCTCCCCAAGGCTGAGAAGCAGACTTCCCAGCAAGCTTCTGAAAGGA 
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AGTCACCGCTGTGGGGAGGCAGGAAGCTATAGCGCGGAAATGGAGAGTAAGGCAGTCTCT 

AGGCATCAGGGCAAGACTTCCACAGTGCTGGCCCCAGAAGACAAGGCGAGGGCCCAGAAG 

TGGGTAAGAGGGAAACAGGAGTCAGAACCTGGTGGCCCGCCTTCACCCGGGGCAGCCACT 

CAGGAGGAGACTCATGCAAGTGGAGAGAAACATCTGGGGGTGGAGATCGAAAAGACCTCC 

GGGGAGATTGTCAGATGTGAGAAGTGTAAGAGAGAAAGAGAGCTGCAGTTGGGCCTGCAG 

AGGGAGCCGTGCCCGCTGGGAACCAGTGAGCTGGACCTGGGGAGAGCTCAGAAGAGGGAT 

TCCGAGAAGTTGGTGAGGACCAAGAGCTGCAGGAGGCCTTCTAAGGCAAAATTTACAGAT 

GGAGAGGAAGGGTGGAAGGGTGACAGCCATCGGGGCAGTCCCAGGGACCCCCCTCAGGAA 

ATGAGGAGGCCCAACAGCAACTCAGACAAGAAAGAGATCAGAGGCTCAGAAAGTCAGGAC 

AGTTATCCTCAGGGGGCACCCAAGGCCCAGAAGGACTTCGTGGAAGGGCCACCAGCTGTA 

GAGGAGGGGCCGATAGACATGAGGAGAGAGGACCGGCACACATGCAGGAGCAAGCATGCC 

GCCTGGCTCCGGAGAGAGCAGCAGGCCGAACCCCCACAGCTCCCCAGAACCCGAGGGGAG 

GAGAAGCAAGCAGAGCACGAGAAGAAGCCAGGCGGCTTAGGAGAGAGGAGGGCGCCAGAG 

AAGGAGTCTAAGAGGAAGCTAGAAGAGAAGAGGCCAGAACGACCCAGTGGCCGGAAGCCG 

AGGCCCAAGGGCATCATCTCAGCGGATGTGGAGAAGCACTATGACATAGGTGGGGTCATT 

GGGGATGGCAACTTTGCCACCGTGAAGGAATGCAGGCACCGAGAGACCAAGCAGGCTTAC 

GCCATGAAGATGATTGACAAGTCCCAGCTGAAGGGTAAGGAGGACATTGTCGACAGTGAG 

ATTTTAATCATCCAGAGTCTCTCTCATCCCAACATTGTGAAACTGCATGAGGTCTACGAA 

ACGGAGGCGGAGATCTACCTGATCATGGAGTATGTGCAGGGAGGGGACCTTTTTGATGCC 

ATCGTTGAAAATGTGAAGTTTCCAGAGCCCGAGGCTGCAGTTATGATCACAGACTTGTGT 

AAGGCCTTCGTCCACATGCACGACAAGAATATCGTCCTVCCGGGACGTGAAACCAGAA^ 

CTCCTGGTTCAGCGAAATGAAGACAAGTCTATCACCTTGAAGCTGGCTGATTTTGGCTTG 

GCCAAATATGTGGTGAGGCCTATATTTACTGTGTGTGGGACGCCAACATATGTAGCTCCT 

GAAATTCTTTCTGAGAAAGGTTACGGCCTGGAGGTGGACATGTGGGCGGCAGGTGTGATC 

CTATACATCCTCTTGTGTGGCTTCCCCCCTTTCCGAAGTCCTGAGAGGGACCAAGACGAG 

CTCTTCT^CATCATCCAAGTGGGCCAGTTTGAGTTCCTCTCTCCTTACTGGGACAACATT 

TCTGATGCTGCCAAAGATCTGGTGAGAAATTTGCTGGAGGTGGACCCTAAGAAGCGGTAC 

ACGGCCGAACAGGTCCTACAGCATCCCTGGATTGAGATGGTTGGGCATACCAACACAGGG 

AACTCACAGAAGGAGGAGTCCCCCAACAGTTTAGGTCACTTCCAGAGTCAGCACAAGAAG 

GTTGCAGAGCAGATGCCATAA 

SEQ ID NO: 29_DRAK2_H 

CTCCGCTGCTGTCGCCAGGAGTCACTTCACGAGAAGCCAGGTCACAACCGTCGGCCCTTG 
TCTGGAAAAGTAAAAGTGGATCCTGCCACGTTCGGAGCTCCCTGGCGCCTCGCCCGGCTG 
GAGCTAGAGAACTCGTCCTGTGGCGGCCCCCGGCGTGGGGCGGGACAGCGGCCCCCTGGA 
GGGGGCAGTCCCGGGAGAACCTGCGGCGGCCGGAGCGGTAAAAATAAGTGACTAAAGAAG 
CAGACCTGGGAATCACCTAACTITGTCGAGGAGGAGATTTGATTGCCGAAGTATTTCAGGC 
CTACTAACTACAACTCCTCAAATTCCAATAAAAATGGAAAACTTTAATAATTTCTATATA 
CTTACT^TCTAAAGAGCTAGGGAGAGGAAAATTTGCTGTGGTTAGACAATGTATATCAAA^ 
TCTACTGGCCAAGAATATGCTGCAAAATTTCTAAAAAAGAGAAGAAGAGGACAGGATTGT 
CGGGCAGAAATTTTACACGAGATTGCTGTGCTTGAATTGGCAAAGTCTTGTCCCCGTGTT 
ATTAATCTTCATGAGGTCTATGAAAATACAAGTGAAATCATTTTGATATTGGAATATGCT 
GCAGGTGGAGAAATTTTCAGCCTGTGTTTACCTGAGTTGGCTGAAATGGTTTCTGAAAAT 
GATGTTATCAGACTCATTAAACAAATACTTGAAGGAGTTTATTATCTACATCAGAATAAC 
ATTGTACACCTTGATTTAAAGCCACAGAATATATTACTGAGCAGCATATACCCTCTCGGG 
GACATTAAAATAGTAGATTTTGGAATGTCTCGAAAAATAGGGCATGCGTGTGAACTTCGG 
GAAATCATGGGAACACCAGAATATTTAGCTCCAGAAATCCTGAACTATGATCCCATTACC 
ACAGCAACAGATATGTGGAATATTGGTATAATAGCATATATGTTGTT7UVCTCACACATCA 
CCATTTGTGGGAGAAGATAATCAAGAAACATACCTCAATATTTCTCAAGTTAATGTAGAT 
TATTCGGAAGAAACTTTTTCATCAGTTTCACAGCTGGCCACAGACTTTATTCAGAGCCTT 
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TTAGTAAAAAATCCAGAGAAAAGACCAACAGCAGAGATATGCCTTTCTCATTCTTGGCTA 
CAGCAGTGGGACTTTGAAAACTTGTTTCACCCTGAAGAAACTTCCAGTTCCTCTCAAACT 
CAGGATCATTCTGTAAGGTCCTCTGAAGACAAGACTTCTAAATCCTCCTGTAATGGAACC 
TGTGGTGATAGAGAAGACAAAGAGAATATCCCAGAGGATAGCAGCATGGTTTCCAAAAGA 
TTTCGTTTCGATGACTCATTACCCAATCCCCATGAACTTGTTTCAGATTTGCTCTGTTAG 
CACTTTTTTCTTTGACTCATTTGGACTGAATTTGAAATTTTATATCCACTCCAGTGAGAT 
TATGATTTGTAGCTTCATATATGACT^TGTTTATATTGTAAATGCACTTTTCCATGGAATA 
ATTTAGGGAAGTGTTTTAATGTTAAATTACTAGTTGCTAGCATGTTATGATTTCATATCC 
TGAGATAGCTCTGCAGATAAGAAAATATTT/^TATATGACAAAAAGTAAAATTGTACAT 

GTGAAAG 

SEQ ID NO: 30_W44160_M DRAK2_M 

CCAGACGCGGCTGCACTTTTCAAACCTCAACTGTAAGAAGCGTCGGTCAGCGTCTGTGCG 

GTCGCCGCCGGGAGTCGCCTCACAGGGGCCTGGCTGACGGCGACCAGCCGTTGTGGGGAA 

GAGTGCGAGGTAAAAGTCTGCCTAGAGAAGCAGGTCTGGCAGTCATGAACATGTCTCGGA 

GGAGATTCGATTGCCGAAGTGTCTCAGGCTTGCTAACTACAACCCCTCAAACGCCGATTA 

AAACAGAGAATTTTAATAATTTCTATACTCTTACCCCAAAAGAACTTGGGAGAGGAAAAT 

TTGCTGTGGTTAGACAATGTATATCAAAATCAACTGGACAAGAGTATGCTGCCAAATCCC 

TGAAAAAGAGGAGAAGAGGGCAGGATTGCCGGGCGGAAATTCTGCATGAGATAGCTGTGC 

TGGAGCTGGCCAGGTCTTGTCCCCACGTGATTAATCTGCATGAGGTCTACGAAAATGCAA 

CGGAAATCATTTTGGTGTTAGAATATGCTGCGGGTGGAGAAATTTTCAACCTGTGTTTAC 

CTGAGTTAGCCGAAATGGTATCTGAAAATGATGTTATCAGACTCATTAAACAAATCCTTG 

AAGGAGTTCATTATCTACATCAGAATAACATTGTTCACCTTGATTTAAAGCCTVCAG^ 

TACTTTTGAGCAGTATATACCCACTCGGGGACATAAAAATTGTAGATTTTGGAATGTCTC 

GAAAT^TTGGGAATGCAAGTGAGCTTCGGGAAATCATGGGAACACCTGAATACTTAGCTC 

CAGAAATCCTCAACTATGATCCCATTACCACT^GCAACAGATATGTGGAATATTG 

TAGCGTATATGTTGTTAACTCATACATCACCATTTGTAGGAGAAGATAATCAAGAAACAT 

ATCTGAATATTTCTCAAGTGAATGTAGATTATTCAGAAGAAATGTTTTCATCAGTTTCAC 

AGCTGGCCACAGACTTCATCCAGAGCCTTCTAGTAAAGAACCCAGAGAAAAGACCAACAG 

CAGAATCCTGCCTATCCCACTCATGGCTGCAGCAGTGGGACTTTGGAAGCTTGTTTCATC 

CTGAGGAAACTTCAGGCTCCTCTCAAATTCAGGATCTGACTCTCAGGTCCTCTGAAGAGA 

AGACCTCCAAGTCCTCCTGTAATGGGAGCTGTGGAGCCCGGGAGGACAAGGAGAACATCC 

CTGAAGATGGCAGCTTAGTTTCTAAAAGATTTCGATTCGATGACTCCTTGCCCAGCCCCC 

ATGAACTTGTTCCAGATTTGTTCTGTTAGCATTTTTCTCTGTGACTCATCTGGACTGACT 

CGGAAATTTGAAATCTCTGGTGTGAGATTGTGTTTGTAGCTTCATATATTATGTTTATAT 

TATAAATGCACTTCTGCTTAGAAGAACTTAAGGAACAGTTTAAATGCTAGGCTTCTGTTG 

GCTAGCATATCATTTCTTGTCCTGAAATTGTTTTGCAGAGGAAAATATTTAAGTATATGA 

CAAAAAATGTAAATTGTGTTTAAGAGAACACATGCAACTGAAAGAACTCAAGTTCAGTCA 

GACTTATAAAATGGGTTATATTATGGTTAGTAAAAGTTGAAAAAAAATGAAAACAGGAAT 

TTAGTAGGTTCTAAGGTAAGCCCTATACCATAACTCTATTACAGAGAATCTGTTTGGGGA 

AATGCTGTCAAGGGTAAACCTVCAACATATACTGCTTTATAAATACTCCAGAGAGAGTTTA 

TAGTTGAAAGTATTTCCCAGTTACCAATAATAGCTTGAAACTGTAAGATTTTCTTTGTGT 

GCCATGTGCTCGGTGAGAGGACACAGTCAACCAGAGCAGGGTTGATCCAGGCTGTTTCTC 

TGCAAACCGAGTCAAAACTCGACATCATTTCCAGCTCATGTATTTTGTACGTGC^^ 

TATCAGATCTAATAAGATCTGGAAGATGGATATGC7UUVTAAGAGGCCTTTGTCTTCTAGA 

ATGATTAGAGTAGAGGAGAATTGGATAGTACAGAATATGCTCTAGTTTCAGTCAGACATA 

TTCATAAAGGGAAATGTTAAGTTCTGGCAGCTGACTTAGTGTTGGATGTCTCCTAAGTCT 

CAGGATAGAAGCCCATCATTAGAGCATAGGCACTTCAGGAATTCTTGTGTGAAATTCTAG 

TGAGTGAGGAGGTGTGACATGCAGCTATCTTTGGGCTCCTTTTGTGTGTGTTCTGCTGGA 

CACAACAC7VTGGGAGTGTTCAGTGTTGTCCGTGGTCAATATCTATGTTCAGTCCTGATGG 
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GAGGGGCCTAGGGACTGCTTTGGAGATTTCCCACTGGTGTCCATTTTAAGGTCTGTAATA 
ATGTCATGTTAAGATAACAGATCTCATAAATATGCTACTCTATCAGACTCCGTTGCCAAA 
ACAAATTAAAAGCCTGTGTATTGAAGTGGGTGTTAGTCTAACAACCTGTAAATTCTTGAA 
ATTGTTACTAAAATTCCAAATTCTTTAGATAACTTTAAACTATTTAAATTGAGCATTGCT 
GTCTTTGTTTGATTAAAGGTTGAGTTCCTTTATATCTGTTATTTTTAAAGGAAAAGTTGT 
TTGCCTTTTGTATATGTGTGTGCATATGTGTATGTGTACAGGTATATGTATATATGTATT 
GATAGATAAAATACAGCCTTTAAACAACTTC 

SEQ ID NO: 31_H01248_H, DRAK1_H 

ATGATCCCTTTGGAGAAGCCAGGCAGCGGCGGCTCCTCCCCAGGCGCCACCTCAGGCTCG 
GGCCGGGCAGGCCGGGGTCTGAGCGGGCCGTGCCGGCCGCCGCCGCCGCCCCAGGCCCGC 
GGGCTGCTGACAGAGATACGCGCCGTGGTGCGCACCGAGCCCTTCCAGGACGGCTACAGC 
CTGTGCCCGGGCCGGGAGCTGGGCAGGGGGAAATTTGCAGTGGTGAGAAAATGTATAAAG 
AAAGATTCTGGGAAAGAATTTGCTGCAAAGTTCATGAGAAAAAGAAGAAAAGGCCAAGAT 
TGTCGGATGGAAATAATTCATGAGATTGCTGTACTTGAACTAGCACAAGACAATCCTTGG 
GTCATTAATTTACATGAAGTTTATGAGACTGCATCAGAAATGATCTTAGTTCTGGAATAT 
GCTGCTGGGGGTGAAATCTTTGACCyVGTGTGTTGCAGACAGAGAAGAAGCCTTTAAAGAA 
AAAGATGTTCAAAGACTTATGCGACAGATTTTAGAAGGTGTTCACTTTTTACACACTCGT 
GATGTAGTTCATCTTGATTTGAAGCCTCAGAATATTCTGTTGACAAGTGAATCTCCATTG 
GGTGACATTAAGATTGTTGATTTTGGCCTTTCAAGAATATTGAAGAACAGTGAAGAGCTC 
CGAGAAATTATGGGTACCCCTGAATATGTGGCTCCTGAAATTCTTAGTTATGATCCTATA 
AGCATGGCAACAGATATGTGGAGCATTGGAGTGTTAACATATGTCATGCTTACAGGAATA 
TCACCTTTCTTAGGCAATGATAAACAAGAAACATTCTTAAACATCTCACAGATGAATTTA 
AGTTATTCTGAGGAAGAATTTGATGTTTTGTCTGAGTCGGCTGTTGATTTCATCAGGACA 
CTTTTAGTTAAGAAACCTGAAGATCGAGCCACTGCTGAAGAATGTCTAAAGCACCCCTGG 
TTGACAC71GAGCAGTATTCAAGAGCCTTCTTTCAGGATGGAAAAGGC7VCTAGAAGAAGCA 
AATGCCCTCCAAGAAGGTCATTCTGTGCCTGAAATTAATTCGGATACCGACAAATCAGAA 
ACCGAGGAATCCATTGTAACCGAAGAGTTAATTGTAGTTACTTCATATACTCTAGGACAA 
TGCAGACAGTCTGAAAAAGAGAAAATGGAGCAAAAGGCCATTTCCAAACGATTTAAATTT 
GAGGAACCTTTGCTACAAGAAATTCCAGGAGAATTTATCTACTGA 

SEQ ID NO: 32_AA021445_H 

CGGGGCTGCCGGGGCCGGGACTGGGGGAGCCGGGCCCGCGGGCCGCCTGCTGCCTCCGCC 

CGCGCCGGGGTCCCCAGCCGCCCCCGCTGCCGTGTCCCCTGCGGCCGGCCAGCCGCGTCC 

CCCAGCCCCGGCCTCCCGCGGACCCATGCCCGCCCGTATCGGCTACTACGAGATCGACCG 

CACCT^TCGGCAAGGGCAACTTCGCGGTGGTCAAGCGGGCaiCGCACCTCGTCT^CC^ 

CAAGGTTGCTATCAAGATCATAGATAAGACCCAGCTGGATGAAGAAAACTTGAAGAAGAT 

TTTCCGGGAAGTTCAAATTATGAAGATGCTTTGCCACCCCCATATCATCAGGCTCTACCA 

GGTTATGGAGACAGAACGGATGATTTATCTGGTGACAGAATATGCTAGTGGAGGGGAAAT 

ATTTGACCACCTGGTGGCCCATGGTAGAATGGCAGAAAAGGAGGCACGTCGGAAGTTCAA 

ACAGATCGTCACAGCTGTCTATTTTTGTCACTGTCGGAACATTGTTCATCGTGATTTAAA 

AGCTGAAAATTTACTTCTGGATGCCAATCTGAATATCAAAATAGCAGATTTTGGTTTCAG 

TAACCTGTTCACTCCTGGGCAGCTGCTGAAGACCTGGTGTGGCAGCCCTCCCTATGCTGC 

ACCTGAACTCTTTGAAGGAAAAGAATATGATGGGCCCAAAGTGGACATCTGGAGCCTTGG 

AGTTGTCCTCTACGTGCTTGTGTGCGGTGCCCTGCCATTTGATGGAAGCACACTGCAGAA 

TCTGCGGGCCCGCGTGCTGAGTGGAAAGTTCCGCATCCCATTTTTTATGTCCACAGAATG 

TGAGCATTTGATCCGCCATATGTTGGTGTTAGATCCCAATAAGCGCCTCTCCATGGAGCA 

GATCTGCAAGCACAAGTGGATGAAGCTAGGGGACGCCGATCCCAACTTTGACAGGTTAAT 

AGCTGAATGCCAACAACTAAAGGAAGAAAGACAGGTGGACCCCCTGAATGAGGATGTCCT 

CTTGGCCATGGAGGACATGGGACTGGACAAAGAACAGACACTGCAGTCATTAAGATCAGA 
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TGCCTATGATCACTATAGTGCAATCTACAGCCTGCTGTGTGATCGACATAAGAGACATAA 

AACCCTGCGTCTCGGAGCACTTCCTAGCATGCCCCGAGCCCTGGCCTTTCAAGCACCAGT 

CAATATCCAGGCGGAGCAGGCAGGTACTGCTATGAACATCAGCGTTCCCCAGGTGCAGCT 

GATCT^CCCAGAGAACCAAATTGTGGAGCCGGATGGGACACTGAATTTGGACAGTGATGA 

GGGTGAAGAGCCTTCCCCTGAAGCATTGGTGCGCTATTTGTCAATGAGGAGGCACACAGT 

GGGTGTGGCTGACCCACGCACGGAAGTTATGGAAGATCTGCAGAAGCTCCTACCTGGCTT 

TCCTGGAGTCAACCCCCAGGCTCCATTCCTGCAGGTGGCCCCTAATGTGAACTTCATGCA. 

CAACCTGTTGCCTATGCTUW^CTTGCAACCAACCGGGCAACTTGAGTACAAGGAGCAGTC 

TCTCCTACAGCCGCCCACGCTACAGCTGTTGAATGGAATGGGCCCCCTTGGCCGGAGGGC 

ATCAGATGGAGGAGCCAACATCCAACTGCATGCCCAGCAGCTGCTGAAGCGCCCACGGGG 

ACCCTCTCCGCTTGTCACCATGACACCAGCAGTGCCAGCAGTTACCCCTGTGGACGAGGA 

GAGCTCAGACGGGGAGCCAGACCAGGAAGCTGTGCAGAGGTACTTGGCAAATAGGTCCAA 

AAGACATACACTGGCCATGACCAACCCTACAGCTGAGATCCCACCGGACCTACAACGGCA 

GCTAGGACAGCAGCCTTTCCGTTCCCGGGTCTGGCCTCCTCACCTGGTACCTGATCAGCA 

TCGCTCTACCTACAAGGACTCCAACACTCTGCACCTCCCTACGGAGCGTTTCTCCCCTGT 

GCGCCGGTTCTCAGATGGGGCTGCGAGCATCCAGGCCTTCAAAGCTCACCTGGAAAAAAT 

GGGCAACAACAGCAGCATCT^CT^GCTGCAGCAGGAGTGTGAGCAGCTGCAG 

CGGGGGGCAGATTGATGAAAGAACCCTGGAGAAGACCCAGCAGCAGCATATGTTATACCA 

GCTVGGAGCAGCACCATCAAATTCTCCTVGCAACAAATTCAAGACTCTATCT^ 

GCCATCTCCACCTCTTCAGGCTGCATGTGAAAATCAGCCAGGCCTCCTTACCCATCAGCT 

CCTVGAGGTTAAGGATTCAGCCTTCAAGCCCACCCCCCAACCACCCCAACT^ 

CAGGCAGCCCAGTAATAGTCCTCCCCCCATGAGCAGTGCCATGATCCAGCCTCACGGGGC 

TGCATCITCTTCCCAGTTTCAAGGCTTACCTTCCCGCAGTGCAATCTTTCT^GCAGC^ 

TGAGAACTGTTCCTCTCCTCCCAACGTGGCACTAACCTGCTTGGGTATGCAGCAGCCTGC 

TCAGTCACAGCAGGTCACCATCCAAGTCCAAGAGCCTGTTGACATGCTCAGCAACATGCC 

AGGCACAGCTGCAGGCTCCAGTGGGCGCGGCATCTCCATCAGCCCCAGTGCTGGTCAGAT 

GCAGATGCAGCACCGTACCAACCTGATGGCCACCCTCAGCTATGGGCACCGTCCCTTGTC 

CAAGCAGCTGAGTGCTGACAGTGCAGAGGCTCACAGCTTGAACGTGAATCGGTTCTCCCC 

TGCTAACTACGACCAGGCGCATTTACACCCCCATCTGTTTTCGGACCAGTCCCGGGGTTC 

CCCCAGCAGCTACAGCCCTTCAACAGGAGTGGGGTTCTCTCCAACCCAAGCCCTGAAAGT 

CCCTCCACTTGACCAATTCCCCACCTTCCCTCCCAGTGCACATCAGCAGCCGCCACACTA 

TACCACGTCGGCACTACAGCAGGCCCTGCTGTCTCCCACGCCGCCAGACTATACAAGACA 

CCAGCAGGTACCCCACATCCTTCAAGGACTGCTTTCTCCCCGGCATTCGCTCACCGGCCA 

CTCGGACT^TCCGGCTGCCCCCT^a^GAGTTTGCACAGCTCATTAAAAGGCAGCAGCy^ 

ACGGCAGCAGCAGCAGC7U\CAGCAGCAACAGCAAGAATACCAGGAACTGTTCAGGCACAT 

GAACCAAGGGGATGCGGGGAGTCTGGCTCCCAGCCTTGGGGGACAGAGCATGACAGAGCG 

CCAGGCTTTATCTTATC7VAAATGCTGACTCTTATCACCATCACACCAGCCCCCAGCATCT 

GCTACAAATCAGGGCACAAGAATGTGTCTCACAGGCTTCCTCACCCACCCCGCCCCACGG 

GTATGCTCACCAGCCGGCACTGATGCATTCAGAGAGCATGGAGGAGGACTGCTCGTGTGA 

GGGGGCCAAGGATGGCTTCCAAGACAGTAAGAGTTCAAGTACTITTGACCAAAGGTTGCCA 

TGACAGCCCTCTGCTCTTGAGTACCGGTGGACCTGGGGACCCTGAATCTTTGCTAGGAAC 

TGTGAGTCATGCCCAAGAATTGGGGATACATCCCTATGGTCATCAGCCAACTGCTGCATT 

CAGTAAAAATAAGGTGCCCAGCAGAGAGCCTGTCATAGGGAACTGCATGGATAGAAGTTC 

TCCAGGACAAGCAGTGGAGCTGCCGGATCACAATGGGCTCGGGTACCCAGCACGCCCCTC 

CGTCCATGAGCACCACAGGCCCCGGGCCCTCCAGAGACACCACACGATCCAGAACAGCGA 

CGATGCTTATGTACAGCTGGATAACTTGCCAGGAATGAGTCTCGTGGCTGGGAAAGCACT 

TAGCTCTGCCCGGATGTCGGATGCAGTTCTCAGTCAGTCTTCGCTCATGGGCAGCCAGCA 

GTTTCAGGATGGGGAAAATGAGGAATGTGGGGCAAGCCTGGGAGGTCATGAGCACCCAGA 

CCTGAGTGATGGCAGCCAGCATTTAAACTCCTCTTGCTATCCATCTACGTGTATTACAGA 

CATTCTGCTCAGCTACAAGCACCCCGAAGTCTCCTTCAGCATGGAGCAGGCAGGCGTGTA 
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ACAAGAAACAGAGAGTTTTGTGTACyVGCTTGGGAATGAAAAGGTTGATTGTAAACCCACA 
GTATCTAGCAGCGTTGTGCCAAATTGCCCTTGTGTTTCTCTCCACCCAAAATATCACAGC 
TGCTTTCCTCACATTTGGTTCATCCGTGTGCTGTTCTTTTGGGTTCTGAGAGGGTTTTGC 
CATGTTTGCTTGTATGACCAAGTCACCAAGGAAATAAACAGGAAGGAAATCCATGTTCTC 
C 

SEQ ID NO: 33_2R22-5-ll_H 

CTGGGCCGCTGCCGGTCAGGTCGGCCGCCCCTGACAGCTCCGGGAGCCTCAAGCGCGACA 

GGGCGCCCTCACCTCGGGACATCCACACACCGACCGCTCCTGCTCCAGAGGCT^CAACCC 

AGCGCGCCTAGCCTGGCGCCGTGCAGCGAAGCCCAAGAGCTGGCCTCGCCACGAAGGTTG 

AACCAGCCAAATTTTCGAGACAGCTCACGGCTTAGAGGAAGGTTCATCTAAATAAAGGC^ 

GGCTAAAGTGACATTGCAGGGATTAAATCCTTCTTTGGCTGCCTGTGTGACCAGAAGGCT 

TATTTGCAAGTTTCTTCTTTCCr]XX3GGTCC7lGATTATTAGGTCTCCAGCGCCCTGCAGCT 

TGACAGAAAGAGAAGCTITGAAATGAAGGTCAGAGATGAGATCCCGCAGCAGGGACGTGGG 

GGCCTCCCAGGGGCATTTACGCACCAGAGTGCAAGATTCTCTGGCCATCAAGGGAAATAG 

CAAACAGAAGCCTTTGTCCTGGGGCACAGCCACCTACCACAAAGCATCAGACTCCACGT^ 

TGGCCAGAAAGTTCCTGGAGTCCCATCAGGCCAGTGGGTATGTAACATGTGCCTAATTGT 

ACAGCTAGAGCCTGCAAGTTCAACGTGAGGGAAGGTGGGAAATGTCTTGAGTGAGGCGAG 

CAGCTCCTGGCTGGGCTGGGCAGACTCAGCTACCACGTTCACTGCCTTCCTCTCACTAAA 

GCCGAGAGGGAGGCTGCTCAGCTCTCAGGAAAACTCTTTTGAACCCTGGGCACCTGCTGT 

CCTCAGTTGGCATCTCCCACCCTCTGAGCCTCTTCTGCTCCTGCACAACCTGCCTCTTCG 

CTGAGATGGAGACGTGAGCCCCCGTGGACGATGACTGCAGTGTATATGAATGGAGGTGGC 

CTGGTGAACCCCCACTATGCCCGGTGGGATCGGCGCGACAGTGTAGAAAGTGGCTGTCAG 

ACCGAGAGTAGCAAGGAGGGTGAGGAGGGACAGCCCCGCCAGCTGACGCCCTTCGAGAAA 

CTGACACAGGACATGTCCCAGGATGAGAAGGTGGTGAGGGAGATCACGCTGGGGAAACGG 

ATAGGCTTCTACCGAATTCGAGGGGAAATCGGAAGTGGAAACTTCTCCCAAGTGAAGCTT 

GGGATTCACTCCCTAACCAAAGAAAAGGTGGCCATTAAGATCCTGGACAAGACCAAGTTA 

GACCAGAAAACCCAGAGGCTACTATCCCGAGT^TCTCCAGCATGGAAAAGCTGCACCAT 

CCCAACATCATCCGCCTTTACGAAGTGGTGGAGACCCTATCCAAGCTGCACTTGGTGATG 

GAGTATGCAGGGGGTGGGGAGCTCTTCGGAAAAATTAGCACTGAGGGGAAGCTCTCTGAA 

CCAGAAAGCAAGCTCATCTTCTCCCAGATTGTGTCTGCCGTGAAGCACATGCATGAAAAC 

CAAATTATTCATAGAGATCTGAAAGCAGAAAATGTATTCTATACCAGTAATACTTGTGTG 

AAGGTGGGCGATTTTGGATTCT^GCACAGTAAGCAAAAAAGGTGAAATGCTGAAC^ 

TGTGGGTCTCCTCCCTACGCTGCGCCTGAACTCTTCCGGGACGAGCACTACATCGGCATT 

TACGTGGATATCTGGGCCTTGGGGGTGCTTTTGTACTTCATGGTGACTGGCACCATGCCA 

TTTCGGGCAGAAACCGTGGCCAAACTAAAAAAGAGCATCCTCGAGGGCACATACAGTGTA 

CCGCCGCACGTGTCAGAGCCCTGCCACCGACTCATCCGAGGAGTCCTTCAGCAGATCCCC 

ACGGAGAGGTACGGAATCGACTGCATCATGAATGATGAATGGATGCAAGGGGTGCCATAC 

CCTACACCTTTGGAACCTTTCCAACTGGATCCCAAACATTTGTCGGAAACCAGCACTCTC 

AAGGAAGAAGAAAATGAGGTCAAAAGCACTTTAGAACATTTGGGCATTACAGAAGAGCAT 

ATTCGAAATAACCAAGGGAGAGATGCTCGCAGCTCAATCACAGGGGTCTATAGAATTATT 

TTACATAGAGTCCAAAGGAAGAAGGCTTTGGAAAGTGTCCCAGTCATGATGCTACCAGAC 

CCTAAAGAAAGAGACCTCAAAAAAGGGTCCCGTGTCTACAGAGGGATAAGACACACATCC 

AAATTTTGCTCGATTTTATAAATTGCACTAGACTGCTTGTAACTAACCAAGATGATTGTT 

GCTGCTTCTAAATTTTTTTCAAGGACAACTTGAGTGGAGACATTTTTGTAATTTTTAAAT 

AAACTTAAATTTGAGATATGCAAAAAAAAAA 

SEQ ID NO: 34_R31237_1_H, AAC33487 

ATGTCCACTAGGACCCCATTGCCAACGGTGAATGAACGAGACACTGAAAACCACACGTCA 
CATGGAGATGGGCGTCAAGAAGTTACCTCTCGTACCAGCCGCTCAGGAGCTCGGTGTAGA 
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AACTCTATAGCCTCCTGTGCAGATGAACAACCTCACATCGGAAACTACAGACTGTTGAAA 
ACAATCGGCAAGGGGAATTTTGCAAAAGTAAAATTGGCAAGACATATCCTTACAGGCA^^ 
GAGGTTGCAATAAAAATAATTGACAAAACTCAGTTGAATCCTU^CTAGTCTACAAAAGCTC 
TTCAGAGAAGTAAGAATAATGAAGATTTTAAATCATCCCAATATAGTGAAGTTATTCGAA 
GTCATTGAAACTGAAAAAACACTCTACCTAATCATGGAATATGCAAGTGGAGGTGAAGTA 
TTTGACTATTTGGTTGCACATGGCAGGATGAAGGAAAAAGAAGCAAGATCTAAATTTAGA 
CAGATTGTGTCTGCAGTTCAATACTGCCATCAGAAACGGATCGTACATCGAGACCTCAAG 
GCTGAAAATCTATTGTTAGATGCCGATATGAACATTAAAATAGCAGATTTCGGTTTTAGC 
AATGAATTTACTGTTGGCGGTAAACTCGACACGTTTTGTGGCAGTCCTCCATACGCAGCA 
CCTGAGCTCTTCCAGGGCAAGAAATATGACGGGGCAGAAGTGGATGTGTGGAGTCTGGGG 
GTCATTTTATACACACTAGTCAGTGGCTCACTTCCCTTTGATGGGCAAAACCTAAAGGAA 
CTGAGAGAGAGAGTATTAAGAGGGAAATACAGAATTCCCTTCTACATGTCTACAGACTGT 
GAAAACCTTCTCAAACGTTTCCTGGTGCTAAATCCAATTAAACGCGGCACTCTAGAG^^ 
ATCATGAAGGACAGGTGGATCAATGCAGGGCATGAAGAAGATGAACTCAAACCATTTGTT 
GAACCAGAGCTAGACATCTCAGACCAAAAAAGAATAGATATTATGGTGGGAATGGGATAT 
TCACAAGAAGAAATTCAAGAATCTCTTAGTAAGATGAAATACGATGAAATCACAGCTACA 
TATTTGTTATTGGGGAGAAAATCTTCAGAGCTGGATGCTAGTGATTCCAGTTCTAGCAGC 
AATCTTTCACTTGCTAAGGTTAGGCCGAGCAGTGATCTCAACAACAGTACTGGCCAGTCT 
CCTCACCT^CAAAGTGCAGAGAAGTGTTTCTTCAAGCCAAAAGCAAAGACGCTACAGTGAC 
CATGCTGGACCAGCTATTCCTTCTGTTGTGGCGTATCCGAAAAGGAGTCAGACAAGCACT 
GCAGATGGTGACCTCAAAGAAGATGGAATTTCCTCCCGGAAATCAAGTGGCAGTGCTGTT 
GGAGGAAAGGGAATTGCTCCAGCCAGTCCCATGCTTGGGAATGCAAGTAATCCTAATAAG 
GCGGATATTCCTGAACGCAAGAAAAGCTCCACTGTCCCTAGTAGTAACACAGCATCTGGT 
GGAATGACACGACGAAATACTTATGTTTGCAGTGAGAGAACTACAGCTGATAGACACTCA 
GTGATTCAGAATGGCAAAGAAAACAGCACTATTCCTGATCAGAGAACTCCAGTTGCTTCA 
ACACACAGTATCAGTAGTGCAGCCACCCCAGATCGAATCCGCTTCCCAAGAGGCACTGCC 
AGTCGTAGCACTTTCCACGGCCAGCCCCGGGAACGGCGAACCGCAACATATAATGGCCCT 
CCTGCCTCTCCCAGCCTGTCCCATGAAGCCACACCATTGTCCCAGACTCGAAGCCGAGGC 
TCCACTAATCTCTTTAGTAAATTAACTTCAAAACTCACAAGGAGTCGCAATGTATCTGCT 
GAGCAAAAAGATGAAAACAAAGAAGCAAAGCCTCGATCCCTACGCTTCACCTGGAGCATG 
AAAACCACTAGTTCAATGGATCCGGGGGACATGATGCGGGAAATCCGCAAAGTGTTGGAC 
GCCAATAACTGCGACTATGAGCAGAGGGAGCGCTTCTTGCTCTTCTGCGTCCACGGAGAT 
GGGCACGCGGAGAACCTCGTGCAGTGGGAAATGGAAGTGTGCAAGCTGCCAAGACTGTCT 
CTGAACGGGGTCCGGTTTAAGCGGATATCGGGGACATCCATAGCCTTCAAAAATATTGCT 
TCCAAAATTGCCAATGAGCTAAAGCTGTAA 

SEQ ID NO: 35_W90839_M 

AAAGGGCCGTCCTGGTCCAGCCGTTCCCTGGGTGCCCGTTGCCGGAACTCTATCGCTTCC 
TGCCCTGAGGAACAACCCCATGTGGGCAACTATAGGCTGCTAAGGACCATCGGGAAGGGC 
AACTTCGCCAAAGTCAAGCTGGCTCGGCATATCCTCACGGGCCGGGAGGTCGCTATTAAG 
ATCATTGATAAGACCCAGCTGAACCCCAGTAGCTTGCAGAAGCTGTTCAGAGAAGTCCGA 
ATTATGAAGGGACTCAACCACCCCAACATCGTGAAGCTTTTTGAGGTGATAGAGACGGAG 
AAGACGCTATACCTGGTGATGGAATACGCTAGCGCAGGAGAAGTGTTTGACTACCTCGTG 
TCGCACGGCCGCATGAAGGAGAAGGAGGCTCGAGCCAAGTTCCGGCAGATCGTGTCAGCC 
GTGCACTACTGTCATCAGAAGAACATTGTACACAGGGATCTAAAGGCTGAAAACCTGTTG 
CTGGATGCCGAGGCCAACATCAAAATCGCCGACTTCGGCTTCAGCAATGAGTTCACGCTG 
GGCTCCAAGCTGGACACCTTCTGTGGGAGCCCCCCATACGCCGCCCCAGAGCTGTTCCAG 
GGCAAGAAGTATGATGGGCCAGAGGTGGACATCTGGAGCCTGGGTGTCATCCTGTACACG 
CTGGTCAGCGGCTCCCTGCCCTTCGATGGGCACAACCTCAAGGAGCTGCGGGAGCGAGTC 
CTCAGAGGAAAGTACCGGGTCCCCTTCTACATGTCTACAGACTGCGAGAGCATTCTGCGG 




WEST 
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AGATTTCTGGTGCTGAACCCCGCAAAACGCTGTACTCTGGAGCAAATCATGAAAGACAAA 

TGGATCAACATCGGCTATGAGGGTGAGGAGCTGT^GCCAGACACGGAGCTCAAAGAAGAG 

CGGATGCCGGGTCGGAAAGCGAGCTGCAGTGCAGTGGGCAGTGGAAGTCGAGGCTTGCCC 

CCCTCCAGCCCCATGGTCAGCAGTGCCCACAACCCCAATAAGGCT^GAGATCCC 

CGGAAGGACAGCACTAGCACCCCTAACAACCTCCCCCCCAGCATGATGACCCGAAGAAAC 

ACCTATGTGTGCACAGAGCGACCAGGATCTGAACGCCCGTCCTTGTTGCCAAATGGCAAA 

GA?^TAGCTCCGGTACCTCGCGGGTGCCCCCTGCCTCGCCTTCCAGTCATAGCCTGGCT 

CCCCCGTCAGGCGAGCGGAGCCGCCTGGCTCGGGGCTCCACCATCCGCAGaVCCTTCCAT 

GGGGGCCAGGTCCGAGACCGGCGGGCAGGGAGCGGGAGTGGCGGGGGTGTGCAGAATGGA 

CCCCCAGCCTCACCCACGCTTGCCCACGAGGCCGCACCCCTGCCCTCCGGGCGGCCTCGC 

CCCACCACCAACCTCTTCACCAAGCTGACCTCCAAACTGACCCGAAGGGTCACAGACGAA 

CCTGAGAGAATCGGGGGACCTGAGGTCACAAGTTGCCATCTACCTTGGGATAAAACGGA^ 

ACCGCCCCCAGGCTGCTCCGATTCCCCTGGAGTGTGAAGCTGACCAGCTCGCGACCTTCC 

TGAGGCCCTGATGGCTGCCCTGCGACAGGCCACA 

SEQ ID NO: 36_406786 . 5_H 

GTAGCCGGCTTGGCGTGACCGTCGCCTGATCCAGTTGTTAGAGGTGGAAGCTTGGCAGTT 

GGCCTCCCTTCTTCCCATGGAGGTCGGGGGCTTAACAGTCTTTGAAGAGGACCAGAGATG 

CCTTTCCCAGAGCCTCCCCTTGCCAGTGTCAGCAGAGGGCCCAGCTGCACAGACCACTGC 

TGAGCCCAGCAGGTCGTTTTCCTCTIGCCCACAGACACCTGAGCAGAAGGAATGGGCTTTC 

CAGACTCTGCCAGAGCAGGACGGCGCTCTCTGAAGACAGATGGAGCTCCTATTGTCTATC 

ATCACTGGCTGCCCAGAATATTTGTACAAGTAAACTGCACTGCCCTGCTGCCCCTGAGCA 

CACGGACCCGTCCGAACCGCGGGGCAGTGTGTCCTGCTGCTCCCTGCTGCGGGGACTGTC 

CTCAGGGTGGTCCTCACCTCTGCTTCCGGCCCCTGTGTGCAACCCTAAC7UVGGCCATCTT 

CACGGTGGATGCCAAQACCACAGAGATCCTCGTTGCTAACGACAAAGCTTGCGGGCTCCT 

GGGGTACAGCAGCCAGGACCTGATTGGCCAGAAGCTCACGCAGTTCTTTCTGAGGTC^ 

TTCTGATGTGGTGGAGGCCCTCAGCGAGGAGCACATGGAGGCCGACGGCCACGCTGCGGT 

GGTGTTTGGCACGGTGGTGGACATCATCT^CCCGTAGTGGGGAGAAGATTCCA 

GTGGATGAAGAGGATGCGGCAGGAGCGCCGCCTATGCTGCGTGGTGGTCCTGGAGCCCGT 

GGAGAGGGTCTCGACCTGGGTCGCTTTCCAGAGCGATGGCACCATCACGTCATGTGACAG 

TCTCTTTGCTCATCTTCACGGGTACGTGTCTGGGGAGGACGTGGCTGGGCAGCATATCAC 

AGACCTGATCCCTTCTGTGCAGCTCCCTCCTTCTGGCCAGCACATCCCAAAGAATCTCAA 

GATTCAGAGGTCTGTTGGAAGAGCCAGGGACGGTACCACCTTCCCTCTGAGCTTAAAGCT 

GAAATCCCAACCCAGCAGCGAGGAGGCGACCACCGGTGAGGCGGCCCCTGTGAGCGGCTA 

CCGGGCATCTGTCTGGGTGTTCTGCACCATCAGTGGCCTCATCACCCTCCTGCCGGATGG 

GACCATCCACGGCATCAACCACAGCTTCGCGCTGACACTGTTTGGTTACGGAAAGACGGA 

GCTCCTGGGCAAGAATATCACTTTCCTGATTCCTGGTTTCTACAGCTACATGGACCTTGC 

GTACAACAGCTCATTACAGCTCCCAGACCTGGCCAGCTGCCTGGACGTCGGCTUVTG^ 

TGGGTGTGGGGAGAGAACCTTGGACCCGTGGCAGGGCCAGGACCCAGCTGAGGGGGGCCA 

GGATCCAAGGATTAATGTCGTGCTTGCTGGTGGCCACGTTGTGCCCCGAGATGAGATCCG 

GAAGCTGATGGAAAGCCAAGACATCTTCACCGGGACTCAGACTGAGCTGATTGCTGGAGG 

CCAGCTCCTTTCCTGCCTCTCACCTCAGCCTGCTCC7VGGGGTGGACAATGTCCCAGAAGG 

AAGCCTGCCAGTGCTICGGTGAACAGGCGCTGCCCAAGGACCAGCAAATCACTGCCT^ 

GAGAGAGGAACCTGTGGCAATAGAGAGCCCCGQACAGGATCTTCTGGGAGAAAGCAGGTC 

TGAACCAGTGGATGTGAAGCCATTTGCTTCCTGCGAAGATTCTGAAGCTCCAGTCCCAGC 

TGAGGATGGGGGCAGTGATGCTGGCATGTGTGGCCTGTGTCAGAAGGCCCAGCTAGAGCG 

GATGGGAGTCAGTGGTCCCAGCGGTTCAGACCTTTGGGCTGGGGCTGCCGTGGCCAAGCC 

CCAGGCCAAGGGTCAGCTGGCGGGGGGCAGCCTCCTGATGCACTGCCCTTGCTATGGGAG 

TGAATGGGGCTTGTGGTGGCGAAGCCAGGACTTGGCCCCCAGCCCCTCTGGGATGGCAGG 

CCTCTCGTTTGGGACACCTACTCTAGATGAGCCGTGGCTGGGAGTGGAAAACGACCGAGA 
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AGAGCTGCAGACCTGCTTGATTAAGGAGCAGCTGTCCCAGTTGAGCCTTGCAGGAGCCCT 
GGATGTCCCCCACGCCGAACTCGTTCCGACAGAGTGCCAGGCTGTCACCGCTCCTGTGTC 
GTCCTGCGATCTGGGAGGCAGAGACCTGTGCGGTGGCTGCACGGGGAGCTCCTCAGCCTG 
CTATGCCTTGGCCACGGACCTCCCTGGGGGCCTGGAAGCAGTGGAGGCCCAGGAGGTTGA 
TGTGAATTCGTTTTCCTGGAACCTCAAGGAACTCTTTTTCAGTGACCAGACAGACCAAAC 
GTCATCAAATTGTTCCTGTGCTACGTCTGAACTCAGAGAGACACCCTCTTCCTTGGCAGT 
GGGCTCCGATCCAGATGTAGGCAGTCTCCAGGAACAGGGGTCGTGTGTCCTGGATGACAG 
GGAGCTGTTACTACTGACCGGCACCTGTGTTGACCTTGGCCAAGGCCGACGGTTCCGGGA 
GAGCTGTGTGGGACATGATCCAACAGAACCGCTTGAGGTTTGTTTGGTGTCCTCTGAGCA 
TTATGCAGC?y^GCGACAGAGAAAGCCCAGGACACGTTCCTTCCACGTTGGATGCTGGCCC 
TGAGGACACGTGCCCATCAGCAGAGGAGCCAAGGCTGAACGTCCAGGTCACCTCCACGCC 
CGTGATCGTGATGCGCGGGGCTGCTGGCCTGCAGCGGGAGATCCAGQAGGGTGCCTACTC 
CGGGAGCTGCTACC7VTCGAGATGGCTTACGGCTGAGTATACAGTTTGAGGTGAGGCGGGT 
GGAGCTCCAGGGCCCCACACCTCTGTTCTGCTGCTGGCTGGTGAAAGACCTCCTCCACAG 
CCT^ACGCGACTCAGCCGCCAGGACCCGCCTGTTCCTTGCCAGCCTGCCCGGCTCCACCCTV 
CTCTACCGCTGCTGAGCTCACCGGACCCAGCCTGGTGGAAGTGCTCAGAGCCAGACCCTG 
GTTTGAGQAGCCCCCCAAGGCTGTGGAACTGGAGGGGTTGGCGGCCTGTGAGGGCGAGTA 
CTCCCAAAAGTACAGTACCATGAGCCCGCTGGGCAGTGGGGCCTTCGGCTTCGTGTGGAC 
TGCTGTGGACAAGGGAAAAAACAAGGAGGTGGTGGTGAAGTTTATTAAGAAGGAGAAGGT 
CTTGGAGGATTGTTGGATTGAGGATCCCAAACTTGGGAAAGTTACTTTAGAGATCGCAAT 
TCTATCCAGGGTGGAGCACGCCAATATCATCAAGGTATTGGATATATTTGAAAACCAAGG 
GTTCTTCCAGCTTGTGATGGAGAAGCACGGCTCCGGCCTAGACCTCTTCGCTTTCATCGA 
CCGCCT^CCCCAGGCTGGATGAGCCCCTGGCGAGCTACATCTTCCGACAAGTGAGAGCAGG 
CCAGAGCCGTCTAGTGTCAGCAGTGGGATACCTGCGCTTGAAGGACATCATCCACCGTGA 
CATCAAGGATGAGAACT^TCGTGATCGCTGAGGACTTCAC^^TCAAGCTGATAGACTTTGG 
CTCGGCCGCCTACTTGGAAAGGGGAAAATTATTTTATACTTTTTGTGGGACCATCGAGTA 

ctgtgcaccggaagttctcatggggaatccctacagagggccggagctggagatgtggtc 
tctgggagtcactctgtacacgctggtctttgaggagaaccccttctgtgagctggagga 
gaccgtggaggctgccatacacccgccatacctggtgtccaaagaactcatgagccttgt 
gtctgggctgctgcagccagtccctgagagacgcaccaccttggagaagctggtgacaga 
cccgtgggtaacacagcctgtgaatcttgctgactatacatgggaagaggtgtttcgagt 
aaacaagccagaaagtggagttctgtccgctgcgagcctggagatggggaacaggagcct 
gagtgatgtggcccaggctcaggagctttgtgggggccccgttccaggcgaggctcctaa 
tggccaaggctgtttgcatcccggggatccccgtctgctgaccagctaaacaccaatttc 
ttcctgcttttctccacttggtttggaaaatcacacj^gttttcaggctccatctgtttg 

seq id no: 37_aa544838_m 406786_m 

ccacgcgtccgcatccctgcttggatgagcccctggcgagtttcatctttcgacaactag 
tgtctgctgtaggatacctgcactcccaaggcatcatccatagagacatcaaggatgaga 
acattgtgattgctgaggacttcacaattaagctgatagattttggctcagctgcctact 
tagagaggggcaaactattttataccttttgtggaacaatcgaatactgtgcacctgagg 
ttctcattggaaatccctacagagggccagagctggagatgtggtctctgggggtcaccc 
tgtacacgctcatcttcgaggagaatcccttctgtgaggtggaggagaccatggaggcag 
ttattcatcccccattcctggtttcccaagaacttatgagtcttctgtctggactgctgc 
agccttgccctgagcagcggaccactttggagaagctgatcagggacccctgggtgacac 
agcctgtgaaccttgctagctatacttgggaagaggtgtgtaggaccaaccagccagaaa 
gtggcctgctgtcagctgcaagtctggagattgggagtaggagtccaagtgaaatggctc 
agagagagggtctctgtgggcctcctgctcccagggagactcgtggtgaccagcactgct 
tgcatcttaaggacccctctttgccagtcagctgagcaagctctcctgctctttggtttg 
ggcagttgtatggatttcagggctttctacctggagaaaggaagttgtgaaggattggga 
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TGACTTCTGCTTCTAGATTCCTATGCAAATGCTACAAGAGCCTGCGATGCTAGTTTTCTT 
AGGTTTATGATATAGACTTGTAATTCATGTTTTTTTATAACCTTGAAAATCATTCTAATG 
TTCAGTTATACTGTACTATTAAAGGGCTTTAAGTTGTAAGCCTCAGAAAGACACAAGGAG 
TGTTTAAGTTCTCTATTTTTGTTGTTTGTTTTTGCTTGTAAGTTTTTGAGACAGGATCTC 
ACCATGTAACTTTGGCTGGCCTGGAACTCAACTATGTAGACCAGGTAGACCTTAAACTGA 
CAGATCTGCCTGCGCTTGCCTCCCT^GCATTAGGACTGATGGTGTGTGTCACCATGCCCA 
GTTCTTCCTGGTTTTGTGTGTAGGTTTCTTTCCCACTGACTTGGTACATGTGACATGTGA 
CAGATGTATGGAGTCTATAGAAGTGGCCAGACAAAAATGGCCAGAATATTTATTTATTTT 
CTTAAAAATTTTCCAAATTAAAGCTACTTAGTTAACAGTTAAACTGGCCAGGACTATATG 
AGATAAACTTGGTTTTCTATTTCTTTTTGT 

SEQ ID NO: 3 8_AA7 8 5 7 3 5_H 

GGCACGAGGCGCGCCTGGCTGGGCCCTGCGGAGGANGGGAAGGAGCGAAGGAGCGAAGGA 

GCAAGCGGAGCGCAGTTCGCCCAAGCCAAGCCGCGCTGCCAACCCTCCCGCCCGCCCGCG 

CTCCTGTCCGCCGTGTCTAGCAGCGGGGCCCAGCATGGTCATGGCGGATGGCCCGAGGCA 

CTTGCAGCGCGGGCCGGTCCGGGTGGGGTTCTACGACATCGAGGGCACGCTGGGCAAGGG 

CAACTTCGCTGTGGTGAAGCTGGGGCGGCACCGGATCACCAAGACGGAGGTGGCAATAAA 

AATAATCGATAAGTCTCAGCTGGATGCAGTGAACCTTGAGAAAATCTACCGAGAAGTACA 

AATAATGAAAATGTTAGACCTVCCCTCTVCATAATCAAACTTTATCT^GGTAATGG 

AAGTATGTTGTACCTTGTGACAGAATATGCCAAAAATGGAGAAATTTTTGACTATCTTGC 

TAATCATGGCCGGTTAAATGAGTCTGAAGCCAGGCGAAAATTCTGGCAAATCCTGTCTGC 

TGTTGATTATTGTCATGGTCGGAAGATTGTGCACCGTGACCTCAAAGCTGAAAATCTCCT 

GCTGGATAACAACATGAATATCAAAATAGCAGATTTCGGTTTTGGAAATTTCTTTAA^ 

TGGTGAACTGCTGGCAACATGGTGTGGCAGCCCCCCTTATGCAGCCCCAGAAGTCTTTGA 

AGGGCAGCAGTATGAAGGACCACAGCTGGACATCTGGAGTATGGGAGTTGTTCTTTATGT 

CCTTGTCTGTGGAGCTCTGCCCTTTGATGGACCGACTCTTCCAATTTTGAGGCAGAGGGT 

TCTGGAAGGAAGATTCCGGATTCCGTATTTCATGTCAGAAGATTGCGAGCACCTTATCCG 

AAGGATGTTGGTCCTAGACCCATCCAAACGGCTAACCATAGCCCAAATCAAGGAGCATAA 

ATGGATGCTCATAGAAGTTCCTGTCCAGAGACCTGTTCTCTATCCACAAGAGCAAGAAAA 

TGAGCCATCCATCGGGGAGTTTAATGAGCAGGTTCTGCGACTGATGCACAGCCTTGGAAT 

AGATCAGCAGAAARCCATTGAGTCTTTGCAGAACAAGAGCTATAACCACTTTGCTGCCAT 

TTATTTCTTGTTGGTGGAGCGCCTGAAATCACATCGGAGCAGTTTCCCAGTGGAGCAGAG 

ACTTGATGGCCGCCAGCGTCGGCCTAGCACCATTGCTGAGCAAACAGTTGCCAAGGC^ 

GACTGTGGGGCTCCCAGTGACCATGCATTCACCGAACATGAGGCTGCTGCGATCTGCCCT 

CCTCCCCCAGGCATCCAACGTGGAGGCCTTTTCATTTCCAGCATCTGGCTGTCAGGCGGA 

AGCTGCATTCATGGAAGAAGAGTGTGTGGACACTCCAAAGGTCAATGGCTGTCTGCTTGA 

CCCTGTGCCTCCTGTCCTGGTGCGGAAGGGATGCCAGTCACTGCCCAGCAACATGATGGA 

GACCTCCATTGACGAAGGGCTGGAGACAGAAGGAGAGGCCGAGGAAGACCCCGCTCATGC 

CTTTGAGGCATTTCAGTCCACACGCAGCGGGCAGAGACGGCACACTCTGTCAGAAGTGAC 

OUITCAACTGGTCGTGATGCCTGGGGCAGGGAAAATTTTCTCCATGAATGACAGCCCCTC 

CCTTGACAGTGTGGACTCTGAGTATGATATGGGGTCTGTTCAGAGGGACCTGAACTTTCT 

GGAAGACAACCCTTCCCTTAAGGACATCATGTTAGCCAATCAGCCTTCACCCCGCATGAC 

ATCTCCCTTCATAAGCCTGAGACCTACCAACCCAGCCATGCAGGCTCTGAGCTCCCAGAA 

ACGAGAGGTCCACAACAGGTCTCCAGTGAGCTTCAGAGAGGGCCGCAGAGCATCAGATAC 

CTCCCTCACCCAGGGAATTGTAGCATTTAGACAACATCTTCyVGAATCTGGCTAGAACCA^ 

AGGAATTCTAGAGTTGAACAAAGTGCAGTTGTTGTATGAACAAATAGGACCGGAGGCAGA 

CCCTAACCTGGCGCCGGCGGCTCCTCAGCTCCAGGACCTTGCTAGCAGCTGCCCTCAGGA 

AGAAGTTTCTCAGCAGCAGGAT^GCGTCTCCACTCTCCCTGCCAGCGTGCATCCCCAGCT 

GTCCCCACGGCAGAGCCTGGAGACCCAGTACCTGCAGCACAGACTCCAGAAGCCCAGCCT 

TCTGTCAAAGGCCCAGAACACCTGTCAGCTTTATTGCAAAGAACCACCGCGGAGCCTTGA 
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GCAGCAGCTGCAGGAACATAGGCTCCAGCAGAAGCGACTCTTTCTTCAGAAGCAGTCTCA 

ACTGCAGGCCTATTTTAATCAGATGCAGATAGCAGAGAGCTCCTACCCACAGCCAAGTCA 

GCAGCTGCCCCTTCCCCGCCAGGAGACTCCACCGCCTTCTCAGCAGGCCCCACCGTTCAG 

CCTGACCCAGCCCCTGAGCCCCGTCCTGGAGCCTTCCTCCGAGCAGATGCAATACAGCCC 

TTTCCTCAGCCAGTACCAAGAGATGCAGCTTCAGCCCCTGCCCTCCACTTCCGGTCCCCG 

GGCTGCTCCTCCTCTGCCCACGCAGCTACAGCAGCAGCAGCCGCCACCGCCACCACCCCC 

TCCACCACCACGACAGCCAGGAGCTGCCCCAGCCCCCTTACAGTTCTCCTATCAGACTTG 

TGAGCTGCCAAGCGCTGCTTCCCCTGCGCCAGACTATCCCACTCCCTGTCAGTATCCTGT 

GGATGGAGCCCAGCAGAGCGACCTAACGGGGCCAGACTGTCCCAGAAGCCCAGGACTGCA 

AGAGGCCCCCTCCAGCTACGACCCACTAGCCCTCTCTGAGCTACCTGGACTCTTTGATTG 

TGAAATGCTAGACGCTGTGGATCCACAACACAACGGGTATGTCCTGGTGAATTAGTCTCA 

GCACAGGAATTGAGGTGGGTCT^GGTGAAGGAAGAGTGTATGTTCCTATTTTTATTCCAGC 

CTTTTAAATTTAAAGCTTATTTTCTTGCCCTCTCCCTAACGGGGAGAAATCGAGCCACCC 

AACTGGAATCAGAGGGTCTGGCTGGGGTGGATGTTGCTTCCTCCTGGTTCTGCCCCACCA 

CAAAGTTTTCTGTGGCAAGTGCTGGAACATAGTTGTAGGCTGAGGCAGGAGAATGGCGTG 

AACCCGGGAGGCGGAGCTTGCAGTGAGCCAAGATCGTGCCACTGCACTCCAGCCTGGGCG 

ACTGAGCAAGACTCCACCTCTUW^AAAAAAAAAAAGGACT^GAGCAGTA^ 

TGTTTCTAAACTGGACAAAGAGATTTTCTTAAAGTTTCTATCATCTCCCTTCTGACAGGT 

TCTACAGTGTGGTCTGAAGCACCTGTAATGTCAGAGCCCTTGTCTGGCCCTTGGTGGCAG 

GTGAACGAAAGCAGTGGAGCCTCTCACCTTCCAGTAGCCTCTCACATTCTTATTTTACCA 

TTTTTGTCCTAATTAAGGTAGCCTAGCTGATTCTAGAAGACAGCCATCCTACGTGCACCC 

CCACCTTGTGTCCACATCTTCTCCAGGCAGGTTTCAACCTATCAGCAGACTCAGGCACAC 

ACTGGGGCTICTIGATAGAGAACCAGGCGGCAGCAGTGCTCGCAGACCCT^CCCAGGGAGAG^ 

TGTGATGGGTTCTGCCCAGATACTCTGCTCGCCCACCCACAAGGGAGCAATAGCTTATAT 

TTGTACATTAGTTTTACCAAGCACTTTCTCTTCTAACCCTCACT^CAATTCTATGAAATT 

AGCTGGGGAGATACTGTCCTTATTITTCACAGCTGAAGAAACCTUVAGCTTTGGGAAGTTT 

GTGACTTCTCTGAGATCACAGCTGGTGATAGAAGGAGCTGGGACACGCGCTTGGGTTGAC 

TGGCTTCTGGTTTTGGTTCTCTGGCTTCTAGTGCTGGAAGAAGCCCTCTCTTTCCCTTCT 

CTTTCCTCAGTAGCATCTGACTCTTTTCATAAGCAAACAGCTGTATAAACAAAGCCCCCA 

TTTTGGTCAAGCACAGGGTGAATGTGATATTGTTCCCACT^CCTTAT^ 

GCCGCCTGGCTTTGGGGAAGAGGCCGCCTTCAGGTGACAGTGCAGCTGTCCAGGTGGCCG 

TGCACTGAACCAGGCTGAGGGAGACAAAAACCCCGCAGACCCGCCTGCCTTTCAGCGTCC 

AGTTAACTGCAGAAGTTTAGGCTCACCTCAAAGATGTCTAGTTTTTCCT^GTTACA^ 

AGCAGTTTCCTACAGAACACCCCCTTCCTCAATTGCCAAGGGGCCGCATCGCACGGCATC 

AGGCCACCACTGCAGGCCAGCAGATTCCT^CCCCAGGAACGGTCT^TGAACTCAG^ 

CTCAACGAGGGGCGTAACATTTCCTTACAGTCAAGCCCCATCAACTAGAAGTGCTTATTA 

CTTTTAGGATTAAAAAAGTAATAACAGACTTTGACTTAATACTCTGTCTTTTCAGAGGCA 

AAGTGGGTGGGTAGAGGGGAGCTTTAAAAATAGAAGTACAAAACAACATCCTGGAAACAT 

ATGACCCCAGATGGAATAATGTCACATTCCCCAGTGCAGATAATGGGCTGCTGCTGGCTC 

TGTGGTGTCTGTCTGCAGAAGATTTGCTCAGTCAAGGAAATTCAAGTGGTGAGACCTTTC 

CACCATGGGTGGTAAGAGAAACCTGCCTTCACCAAAATCTCTGAAGGGGAAAGAAGTGGA 

GAGAAAGGTTTGCTTCACTTCGGGGACTGCAGTTTGAGAAATAAAAGGGATACAGAGATA 

TCTGCACTTTGTAGAAAGGGCAAGATTATTTGCTTATATCTGAAGGGAGGTGGGTGGTTT 

TGCTGGATGTTTGGTCTGAAAGAGTTACTTTTGATAAAGTTAATCTAATTGTAGTTATAT 

TTTCTGTGTGCTTTTTTTTAATTACTAAGAAAAAAATTGGTGAGTTCAGTAGCTTTGGTA 

TTATGAGTGCAAATCATAATAGCTCCAATGTGAAAAAAAAAATCAAAAGTATAACTTGTC 

ACTTAATGTTAGAAAATTGCCTAAAATGCAGTGTAATAAATAATCTCTGTACCAAATAGT 

AATTTAAATGGGGTAATTTTCTGCAAGGAAAATGTACTGTTTTTATGTTTCCAACCCTCT 

TGA 
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SEQ ID NO: 39_AA207220_H 

GCTGTGGCTCCCCGTCCTGGTGCGGGACCTGTGCCCCGCXSCTTCAGCCCTCCCCGCAAGC 

CTATTGATTCCCCTGCCGCCCTTGCTCCACCTCCTGCTCGCCATGGAGTCGCTGGTTTTC 

GCGCGGCGCTCCGGCCCCACTCCCTCGGCCGCAGAGCTAGCCCGGCCGCTGGCGGAAGGG 

CTGATCAAGTCGCCCAAGCCCCTAATGAAGAAGCAGGCGGTGAAGCGGCACCACCACAAG 

CACAACCTGCGGCACCGCTACGAGTTCCTGGAGACCCTGGGCAAAGGCACCTACGGGAAG 

GTGAAGAAGGCGCGGGAGAGCTCGGGGCGCCTGGTGGCCATCAAGTCAATCCGGAAGGAC 

AAAATCAAAGATGAGCAAGATCTGATGCACATACGGAGGGAGATTGAGATCATGTCATCA 

CTCAACCACCCTCACATCATTGCCATCCATGAAGTGTTTGAGAACAGCAGCAAGATCGTG 

ATCGTCATGGAGTATGCCAGCCGGGGCGACCTTTATGACTACATCAGCGAGCGGCAGCAG 

CTCAGTGAGCGCGAAGCTAGGCATTTCTTCCGGCAGATCGTCTCTGCCGTGCACTATTGC 

CATCAGAACAGAGTTGTCCACCGAGATCTCAAGCTGGAGAACATCCTCTTGGATGCCAAT 

GGGAATATCAAGATTGCTGACTTCGGCCTCTCCAACCTCTACCATCAAGGCAAGTTCCTG 

CAGACATTCTGTGGGAGCCCCCTCTATGCCTCGCCAGAGATTGTCAATGGGAAGCCCTAC 

ACAGGCCCAGAGGTGGACAGCTGGTCCCTGGGTGTTCTCCTCTACATCCTGGTGCATGGC 

ACCATGCCCTTTGATGGGCATGACCATAAGATCCTAGTGAAACAGATCAGCAACGGGGCC 

TACCGGGAGCCACCTAAACCCTCTGATTGCCTGNNTGGCCTGATCCGGTGGCTGTTGATG 

GTGAACCCCACCCGCCGGGCCACCCTGGAGGATGTGGCCAGTCACTGGTGGGTCAACTGG 

GGCTACGCCACCCGAGTGGGAGAGCAGGAGGCTCCGCATGAGGGTGGGCACCCTGGCAGT 

GACTCTGCCCGCGCCTCCATGGCTGACTGGCTCCGGCGTTCCTCCCGCCCCCTCCTGGAG 

AATGGGGCCAAGGTGTGCAGCTTCrTCAAGCAGCATGCACCTGGTGGGGGAAGCACCACC 

CCTGGCCTGGAGCGCCAGCATTCGCTCAAGAAGTCCCGCAAGGAGAATGACATGGCCCAG 

TCTCTCCACAGTGACACGGCTGATGACACTGCCCATCGCCCTGGCAAGAGCAACCTCAAG 

CTGCCAAAGGGCATTCTCAAGAAGAAGGTGTCAGCCTCTGCAGAAGGGGTACAGGAGGAC 

CCTCCGGAGCTCAGCCCAATCCCTGCGAGCCCAGGGCAGGCTGCCCCCCTGCTCCCCAAG 

AAGGGCATTCTCAAGAAGCCCCGACAGCGCGAGTCTGGCTACTACTCCTCTCCCGAGCCC 

AGTGAATCTGGGGAGCTCTTGGACGCAGGCGACGTGTTTGTGAGTGGGGATCCCAAGGAG 

CAGAAGCCTCCGCAAGCTTCAGGGCTGCTCCTCCATCGCAAAGGCATCCTCAAACTCAAT 

GGCAAGTTCTCCCAGACAGCCTTGGAGCTCGCGGCCCCCACCACCTTCGGCTCCCTGGAT 

GAACTCGCCCCACCTCGCCCCCTGGCCCGGGCCAGCCGACCCTCAGGGGCTGTGAGCGAG 

GACAGCATCCTGTCCTCTGAGTCCTTTGACCAGCTGGACTTGCCTGAACGGCTCCCAGAG 

CCCCCACTGCGGGGCTGTGTGTCTGTGGACAACCTCACGGGGCTTGAGGAGCCCCCCTCA 

GAGGGCCCTGGAAGCTGCCTGAGGCGCTGGCGGCAGGATCCTTTGGGGGACAGCTGCTTT 

TCCCTGACAGACTGCCAGGAGGTGACAGCGACCTACCGACAGGCACTGAGGGTCTGCTCA 

AAGCTCACCTGAGTGGAGTAGGCATTGCCCCAGCCCGGTCAGGCTCTCAGATGCAGCTGG 

TTGCACCCCXSAGGGGAGATGCCTTCTCCCCCACCTCCCAGGACCTGCATCCCAGCTCAGA 

AGGCTGAGAGGGTTTGCAGTGGAGCCCTGAGCAGGGCTGGATATGGGAAGTAGGCAAATG 

AAATGCGCCAAGGGTTCAGTGTCTGTCTTCAGCCCTGCTGAACGAAGAGGATACTAAAGA 

GAGGGGAACGGGAATGCCCGCGACAGAGTCCACATTGCCTGTTTCTTGTGTACATGGAGG 

GGCCACAGAGA 

SEQ ID NO: 40_AA426580_H, MAK_V_H 

ATGCCGGCGGCGGCGGGGGACGGGCTCCTGGGGGAGCCGGCGGCGCCTGGGGGCGGCGGC 
GGCGCGGAGGACGCGGCCAGGCCCGCGGCGGCCTGCGAGGGAAGTTTCCTGCCTGCCTGG 
GTGAGCGGCGTGCCCCGCGAGCGGCTCCGCGACTTCCAGCACCACAAGCGCGTGGGCAAC 
TACCTCATCGGCAGCAGGAAGCTGGGCGAGGGCTCCTTTGCCAAGGTGCGCGAGGGGCTG 
CACGTGCTGACCGGGGAGAAGGTGGCCATAAAAGTCATTGATAAGAAGAGAGCCAAAAAG 
GACACCTATGTCACCAAAAACCTGCGGCGAGAGGGTCAGATCCAGCAGATGATCCGCCAC 
CCCAATATCACTCAGCTCCTTGATATTTTAGAAACGGAAAACAGCTACTACCTGGTCATG 
GAGCTGTGCCCTGGGGGCAACCTGATGCACAAGATCTATGAGAAGAAGCGGCTGGAGGAG 
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TCCGAAGCCCGCAGATACATCCGACAGCTCATCTCTGCCGTAGAGCACCTGCACCGGGCC 

GGGGTGGTCCACAGAGACTTGAAGATAGAGAATTTGCTACTAGATGAAGACAATAATATC 

AAGCTGATTGACTTTGGTTTGAGCAACTGCGCAGGGATCCTGGGTTACTCGGATCCGTTC 

AGCACACAGTGTGGCAGCCCTGCCTACGCTGCACCTGAACTGCTCGCCAGGAAGAAATAC 

GGCCCCAAAATCGATGTCTGGTCCATAGGTGTGAACATGTATGCCATGTTGACCGGGACG 

CTGCCTTTCACGGTGGAGCCTTTCAGCCTGAGGGCTrTGTACCAGAAGATGGTAGACAAA 

GAAATGAACCCCCTCCCCACTCAGCTCTCCACAGGTGCCATCAGTTTCCTGCGCTCTCTC 

CTGGAACCGGATCCTGTGAAGAGGCCAAATATTCAGCAGGCACTGGCGAATCGCTGGCTT 

AATGAGAATTACACGGGCAAAGTGCCCTGTAATGTCACCTATCCCAACAGGATTTCTCTG 

GAAGATCTGAGCCCGAGCGTCGTGCTGCACATGACCGAGAAGCTGGGTTACAAGAACAGC 

GACGTGATCT^CACTGTGCTCrCCT^CCGCGCCTGCCACATCCTGGCCATCTA 

TTAAACAAGAAACTGGAGCGCTATTTGTCAGGGAAATCTGACATCCAGGACAGCCTCTGC 

TACAAGACCCGGCTCTACCAGATAGAAAAGTACAGGGCCCCCAAGGAGTCCTATGAGGCC 

TCTCTGGACACCTGGACACGAGATCTTGAATTCCATGCCGTGCAGGATAAAAAGCCCAAA 

GAACAAGAAAAAAGAGGGGATTTTCTTCATCGACCATTCTCC7UVGAAGTTGGACAAGAAC 

CTGCCCTCGCACAAACAGCCCTCAGGCTCGCTTATGACACAGATTCAGAACACCAAAGCC 

CTCCTGAAGGACCGGAAGGCCTCCAAGTCCAGCTTCCCCGACAAAGATTCCTTTGGCTGC 

CGCAATATTTTCCGCAAAACCTCAGATTCCAATTGTGTGGCTTCTTCTTCCATGGAGTTC 

ATCCCCGTGCCACCGCCCAGGACCCCGAGGATTGTGAAGAAACCGGAGCCCCATCAGCCA 

GGGCCCGGAAGCACTGGCATCCCCCACAAGGAAGACCCCCTGATGCTGGACATGGTGCGC 

TCCTTCGAGTCTGTGGATCGCGACGACCACGTAGAAGTGCTGTCTCCCTCTCATCACTAC 

AGGATTCTGAACTCCCCGGTCAGCTTGGCTCGCAGAAATTCCAGCGAGAGGACGCTGTCC 

CCGGGTCTGCCATCCGGAAGCATGTCGCCTCTCCATACTCCTTTGCATCCAACTCTGGTC 

TCTTTTGCTCACGAAGATAAGAACAGCCCCCCAAAAGAGGAGGGCCTGTGTTGCCCACCT 

CCGGTTCCCAGCAATGGCCCCATGCAGCCTCTGGGGAGCCCCAATTGTGTGAAAAGCCGA 

GGCCGGTTCCCTATGATGGGCATCGGACAGATGTTAAGGAAGCGCCATCAGAGTCTGCAG 

CCATCTGCAGATAGGCCCCTGGAGGCCAGCCTGCCCCCACTGCAGCCCCTAGCCCCTGTG 

AACCTTGCCTTTGACATGGCCGATGGGGTCAAGACCCAGTGCTAA 

SEQ ID NO: 41_Z36720_H 

ATGGACACAAAGCTGAACATGCTGAACGAGAAGGTGGACCAGCTCCTGCACTTCC^^ 

GATGTCACAGAGAAGTTGCAGAGCATGTGCCGAGACATGGGCCACCTGGAGCGGGGCCTG 

CACAGGCTGGAGGCCTCCCGGGCACCGGGCCCGGGCGGGGCTGATGGGGTTCCCCACATT 

GACACCCAGGCTGGGTGGCCCGAGGTCCTGGAGCTGGTGAGGGCCATGCAGCAGGATGCG 

GCCCAGCACGGTGCCAGGCTGGAGGCCCTCTTCAGGATGGTGGCTGCGGTGGACAGGGCC 

ATCGCTTTGGTGGGGGCCACGTTCCAGAAATCAAAGGTGGCGGATTTCCTCATGCAGGGG 

CGTGTGCCCTGGAGGAGAGGCAGCCCAGGTGACAGCCCTGAGGAGTGGGTAAAAGAGGAG 

GAGGTCTGTTTCATGCCTCCAGTTCCCCCAGCTCCGGGGGCAGCAGGACAGAGCCTGCAG 

AAGGATAAGGGGGAGCTGTCTGCCGAGCAGGGGATCTGGGCCACATTGATGACGCTGGTG 

ATCATGGTGACAGCGGCAAATAAAGAGCGAGTGGAAGAAGAGGGAGGAAAACCAAAGCAT 

GTGCTGAGCACCAGTGGGGTGCAGTCTGATGCCAGGGAGCCTGGGGAAGAGAGCCAGAAG 

GCGGACGTGCTGGAGGGGACAGCGGAGAGGCTGCCCCCCATCAGAGCGTCAGGGCTGGGA 

GCTGACCCCGCCCAGGCAGTGGTCTCACCGGGCCAGGGAGATGGTGTTCCTGGCCCAGCC 

CAGGCATTCCCTGGCCACCTGCCCCTGCCCACAAAGGTGGAAGCCAAGGCTCCTGAGACA 

CCCAGCGAGAACCTCAGGACTGGCCTGGAATTGGCTCCAGCACCCGGCAGGGTCAATGTG 

GTCTCCCCGAGCCTGGAGGTTGCACCAGGtGCAGGACTU^GGAGCATCGTCCAGCAOT 

GACCCTGAGCCCTTAGAGGAAGGCACGAGGCTGACTCCAGGGCCTGGCCCTCAGTGCCCA 

GGGCCTCCAGGGCTGCCAGCCCAGGCCAGGGCAACCCACAGTGGTGGAGAAACACCTC 

AGGGCAGCCCTGCTGAAGGGCGCTGTGGCCCCGGGCTTCTCTCGGAGGGACCTGGTGTTT 

CCTAGCATCTTCTGCGCCTGCCTAGGGATCTCCATCCACATACAAGAGATGGATACTCCT 
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GGGGAGATGCTGATGACAGGCAGGGGCAGCCTTGGACCCACCCTCACCACAGAGGCTCCA 
GCAGCTGCCCTIGGCAGGCAAGCAGGGCCCACCTGGGACCGGGCGCTGCCTCCAAGCCCCT 
GGGACTGAGCCCGGAGAACAGACCCCTGAAGGAGCCAGAGAGCTCTCCCCGCTGCAGGAG 
AGCAGCAGCCCCGGGGGAGTGAAGGCAGAGGAGGAGCAAAGGGCTGGGGCCGAGCCTGGC 
ACGAGACCAAGCTTGGCCAGGAGTGACGACAATGACCACGAGGTTGGGGCCCTGGGCCTG 
CAGCAGGGCAAAAGCCCAGGGGCGGGAAACCCTGAGCCTGAGCAGGACTGTGCAGCCAGG 
GCTCCGGTGAGAGCTGAAGCAGTAAGGAGGATGCCCCCAGGCGCCGAGGCTGGCAGCGTG 
GTTCTGGATGACAGTCGGGCCCCACCAGCTCCTTTTGAACACCGGGTAGTGAGCGTCAAG 
GAGACCTCCATCTCTGCGGGTTACGAGGTGTGCCAGCACGAAGTCTTGGGAGGGGGTCGG 
TTTGGCCAGGTCCACAGGTGCACAGAGAAGTCCACAGGCCTCCCACTGGCTGCCAAGATC 
ATCAAAGTGAAGAGCGCCAAGGACCGGGAGGACGTGAAGAACGAGATCAACATCATGAAC 
CAGCTCAGCCACGTGAACCTGATCCAGCTCTATGACGCCTTCGAGAGCAAGCACAGCTGC 
ACCCTTGTCATGGAGTACGTGGACGGGGGTGAGCTCTTCGACCGGATCACAGATGAGAAG 
TACCACCTGACTGAGCTGGATGTGGTCCTGTTCACCAGGCAGATCTGTGAGGGTGTGCAT 
TACCTGCACCAGCACTACATCCTGCACCTGGACCTCAAGCCGGAGAACATATTGTGCGTC 
AATCAGACAGGACATCAAATTAAGATCATTGACTTTGGGCTGGCCAG7VAGGTACAAGCCT 
CGAGAGAAGCTGAAGGTGAACTTCGGCACTCCTGAGTTCCTGGCCCCAGAAGTCGTCAAT 
TATGAGTTTGTCTCATTCCCCACAGACATGTGGAGTGTGGGAGTCATCACCTACATGCTA 
C'TCAGTGGCTTGTCCCCATTTCTAGGGGAAACAGATGCAGAGACCATGAATTTCATTGTA 
AACTGTAGCTGGGATTTTGATGCTGACACCTTTGAAGGGCTCTCGGAGGAGGCCAAGGAC 
TTTGTTTCCCGGTTGCTGGTCAAAGAGAAGAGCTGCAGAATGAGTGCCACACAGTGCCTG 
AAACACGAGTGGCTGAATAATTTGCCTGCCAAAGCTTCAAGATCCAAAACTCGTCTCAAA 
TCCCAACTACTGCTGCAGAAATACATAGCTCAAAGAAAATGGAAGAAACATTTCTATGTG 
GTGACTGCTGCCAACAGGTTAAGGAAATTTCCAACTTCTCCCTT^ 

SEQ ID NO: 42_SGK088_H 

GGGGAGATGGCGCTGTTTGAGTGCCTGGTGGCGGGGCCCACTGACGTGGAGGTGGATTGG 
CTGTGCCGTGGCCGCCTGCTGCAGCCTGCACTGCTCAAATGCAAGATGCATTTCGATGGC 
CGCAAATGCAAGCTGCTACTTACATCTGTACATGAGGACGACAGTGGCGTCTACACCTGC 
AAGCTCAGCACGGCCAAAGATGAGCTGACCTGCAGTGCCCGGCTGACCGTGCGGCCCTCG 
TTGGCACCCCTGTTCACACGGCTGCTGGAAGATGTGGAGGTGTTGGAGGGCCGAGCTGCC 
CGTTTCGACTGCAAGATCAGTGGCACCCCGCCCCCTGTTGTTACCTGGACTCATTTTGGC 
TGCCCCATGGAGGAGAGTGAGAACTTGCGGCTGCGGCAGGACGGGGGTCTGCACTCACTG 
CACATTGCCCATGTGGGCAGCGAGGACGAGGGGCTCTATGCGGTCAGTGCTGTTAACACC 
CATGGCCAGGCCCACTGCTCAGCCCAGCTGTATGTAGAAGAGCCCCGGACAGCCGCCTCA 
GGCCCCAGCTCGAAGCTGGAGAAGATGGCATCCATTCCCGAGGAGCCAGAGCAGGGTGAG 
CTGGAGCGGCTGTCCATTCCCGACTTCCTGCGGCCACTGCAGGACCTGGAGGTGGGACTG 
GCCAAGGAGGCCATGCTAGAGTGCCAGGTGACCGGCCTGCCCTACCCCACCATCAGCTGG 
TTCCACAATGGCCACCGCATCCAGAGCAGCGACGACCGGCGCATGACACAGTACAGGGAT 
GTCCATCGCTTGGTGTTCCCTGCCGTGGGGCCTCAGCACGCCGGTGTCTACAAGAGCGTC 
ATTGCCAACAAGCTGGGCAAAGCTGCCTGCTATGCCCACCTGTATGTCACAGATGTGGTC 
CCAGGCCGTCCAGATGGCGCCCCGCAGGTGGTGGCTGTGACGGGGAGGATGGTCACACTC 
ACATGGAACCCCCCCAGGAGTCTGGACATGGCCATCGACCCGGACTCCCTGACGTACACA 
GTGCAGCACCAGGTGCTGGGCTCGGACCAGTGGACGGCAGTGGTCACAGGCCTGCGGGAG 
CCAGGGTGGGCAGCCACAGGGCTGCGTAAGGGGGTCCAGCACATCTTCCGGGTCCTCAGC 
ACCACTGTCAAGAGCAGCAGCAAGCCCTCACCCCCTTCTGAGCCTGTGCAGCTGCTGGAG 
CACGGCCCAACCCTGGAGGAGGCCCCTGCCATGCTGGACA/^CCAGACATCGTGTATGTG 
GTGGAGGGACAGCCTGCCAGCGTCACCGTCACATTCAACCATGTGGAGGCCCAGGTCGTC 
TGGAGGAGCTGCCGAGGGGCCCTCCTAGAGGCACGGGCCGGTGTGTACGAGCTGAGCCAG 
CCAGATGATGACCAGTACTGTCTTCGGATCTGCCGGGTGAGCCGCCGGGACATGGGGGCC 
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CTCACCTGCACCGCCCGAAACCGTCACG6CACACAGACCTGCTCGGTCACATTGGAGCTG 

GCAGAGGCCCCTCGGTTTGAGTCCATCATGGAGGACGTGGAGGTGGGGGCTGGGGAAACT 

GCTCGCTTTGCGGTGGTGGTCGAGGGAAAACCACTGCCGGACATCATGTGGTACAAGGAC 

GAGGTGCTGCTGACCGAGAGCAGCCATGTGAGCTTCGTGTACGAGGAGAATGAGTGCTCC 

CTGGTGGTGCTCAGCACGGGGGCCCAGGATGGAGGCGTCTACACCTGCACCGCCCAGAAC 

CTGGCGGGTGAGGTCTCCTGCAAAGCAGAGTTGGCTGTGCATTCAGCTCAGACAGCTATG 

GAGGTCGAGGGGGTCGGGGAGGATGAGGACCATCGAGGAAGGAGACTCAGCGACTTTTAT 

GACATCCACCAGGAGATCGGCAGGGGTGCTTTCTCCTACTTGCGGCGCATAGTGGAGCGT 

AGCTCCGGCCTGGAGTTTGCGGCCAAGTTCATCCCCAGCCAGGCCAAGCCAAAGGCATCA 

GCGCGTCGGGAGGCCCGGCTGCTGGCCAGGCTCCAGCACGACTGTGTCCTCTACTTCCAT 

GAGGCCTTCGAGAGGCGCCGGGGACTGGTCATTGTCACCGAGCTCTGCACAGAGGAGCTG 

CTGGAGCGAATCGCCAGGAAACCCACCGTGTGTGAGTCTGAGATCCGGGCCTATATGCGG 

CAGGTGCTAGAGGGAATACACTACCTGCACCAGAGCCACGTGCTGCACCTCGATGTCAAG 

CCTGAGAACCTGCTGGTGTGGGATGGTGCTGCGGGCGAGCAGCAGGTGCGGATCTGTGAC 

TTTGGGAATGCCCAGGAGCTGACTCCAGGAGAGCCCCAGTACTGCCAGTATGGCACACCT 

GAGTTTGTAGCACCCGAGATTGfCAATCAGAGCCCCGTGTCTGGAGTCACTGACATCTGG 

CCTGTGGGTGTTGTTGCCTTCCTCTGTCTGACAGGAATCTCeCCGTTTGTTGGGGAAAAT 

GACCGGACAACATTGATGAACATCCGAAACTACAACGTGGCGTTCGAGGAGACCACATTC 

CTGAGCCTGAGCAGGGAGGCCCGGGGCTTCCTCATCAAAGTGTTGGTGCAGGACCGGCTG 

AGACCTACCGCAGAAGAGACCCTAGAACATCCTTGGTTCAAAACTCAGGCAAAGGGCGCA 

GAGGTGAGCACGGATCACCTGAAGCTATTCCTCTCCCGGCGGAGGTGGCAGCGCTCCCAG 

ATCAGCTACAAATGCCACCTGGTGCTGCGCCCCATCCCCGAGCTGCTGCGGGCCCCCCCA 

GAGCGGGTGTGGGTGACCATGCCCAGAAGGCCACCCCCCAGTGGGGGGCTCTCATCCTCC 

TCGGATTCTGAAGAGGAAGAGCTGGAAGAGCTGCCCTCAGTGCCCCGCCCACTGCAGCCC 

GAGTTCTCTGGCTCCCGGGTGTCCCTCACAGACATTCCCACTGAGGATGAGGCCCTGGGG 

ACCCCAGAGACTGGGGCTGCCACCCCCATGGACTGGCAGGAGCAGGGAAGGGCTCCCTCT 

CAGGACCAGGAGGCTCCCAGCCCAGAGGCCCTCCCCTCCCCAGGCCAGGAGCCCGCAGCT 

GGGGCTAGCCCCAGGCGGGGAGAGCTCCGCAGGGGCAGCTCGGCTGAGAGCGCCCTGCCC 

CGGGCCGGGCCGCGGGAGCTGGGCCGGGGCCTGCACAAGGCGGCGTCTGTGGAGCTGCCG 

CAGCGCCGGAGCCCCGGCCCGGGAGCCACCCGCCTGGCCCGGGGAGGCCTGGGTGAGGGC 

GAGTATGCCCAGAGGCTGCAGGCCCTGCGCCAGCGGCTGCTGCGGGGAGGCCCCGAGGAT 

GGCAAGGTCAGCGGCCTCAGGGGTCCCCTGCTGGAGAGCCTGGGGGGCCGTGCTCGGGAC 

CCCCGGATGGCACGAGCTGCCTCCAGCGAGGCAGCGCCCCACCACCAGCCCCCACTCGAG 

AACCGGGGCCTGCAAAAGAGCAGCAGCTTCTCCCAGGGTGAGGCGGAGCCCCGGGGCCGG 

CACCGCCGAGCGGGGGCGCCCCTCGAGATCCCCGTGGCCAGGCTTGGGGCCCGTAGGCTA 

CAGGAGTCTCCTTCCCTGTCTGCCCTCAGCGAGGCCCAGCCATCCAGCCCTGCACGGCCC 

AGCGCCCCCAAACCCAGTACCCCTAAGTCTGCAGAACCTTCTGCCACCACACCTAGTGAT 

GCTCCGCAGCCCCCCGCACCCCAGCCTGCCCAAGACAAGGCTCCAGAGCCCAGGCCAGAA 

CCAGTCCGAGCCTCCAAGCCTGCACCACCCCCCCAGGCCCTGCAAACCCTAGCGCTGCCC 

CTCACACCCTATGCTCAGATCATTCAGTCCCTCCAGCTGTCAGGCCACGCCCAGGGCCCC 

TCGCAGGGCCCTGCCGCGCCGCCTTCAGAGCCCAAGCCCCACGCTGCTGTCTTTGCCAGG 

GTGGCCTCCCCACCTCCGGGAGCCCCCGAGAAGCGCGTGCCCTCAGCCGGGGGTCCCCCG 

GTGCTAGCCGAGAAAGCCCGAGTTCCCACGGTGCCCCCCAGGCCAGGCAGCAGTCTCAGT 

AGCAGCATCGAAAACTTGGAGTCGGAGGCCGTGTTCGAGGCCAAGTTCAAGCGCAGCCGC 

GAGTCGCCCCTGTCGCTGGGGCTGCGGCTGCTGAGCCGTTCGCGCTCGGAGGAGCGCGGC 

CCCTTCCGTGGGGCCGAGGAGGAGGATGGCATATACCGGCCCAGCCCGGCGGGGACCCCG 

CTGGAGCTGGTGCGACGGCCTGAGCGCTCACGCTCGGTGCAGGACCTCAGGGCTGTCGGA 

GAGCCTGGCCTCGTCCGCCGCCTCTCGCTGTCACTGTCCCAGCGGCTGCGGCGGACCCCT 

CCCGCGCAGCGCCACCCGGCCTGGGAGGCCCGCGGCGGGGACGGAGAGAGCTCGGAGGGC 

GGGAGCTCGGCGCGGGGCTCCCCGGTGCTGGCGATGCGCAGGCGGCTGAGCTTCACCCTG 
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GAGCGGCTGTCCAGCCGATTGCAGCGCAGTGGCAGCAGCGAGGACTCGGGGGGCGCGTCG 
GGCCGCAGCACGCCGCTGTTCGGACGGCTTCGCAGGGCCACGTCCGAGGGCGAGAGTCTG 
CGGCGCCTTGGCCTTCCGCACAACCAGTTGGCCGCCCAGGCCGGCGCCACCACGCCTTCC 
GCCGAGTCCCTGGGCTCCGAGGCCAGCGCCACGTCGGGCTCCTCAGCCCCAGGGGAAAGC 
CGAAGCCGGCTCCGCTGGGGCTTCTCTCGGCCGCGGAAGGACAAGGGGTTATCGCCACCA 
AACCTCTCTGCCAGCGTCCAGGAGGAGTTGGGTCACCAGTACGTGCGCAGTGAGTCAGAC 
TTCCCCCCAGTCTTCCACATCAAACTCAAGGACCAGGTGCTGCTGGAGGGGGAGGCAGCC 
ACCCTGCTCTGCCTGCCAGCGGCCTGCCCTGCACCGCACATCTCCTGGATGAAAGACAAG 
AAGTCCTTGAGGTCAGAGCCCTCAGTGATCATCGTGTCCTGCAAAGATGGGCGGCAGCTG 
CTCAGCATCCCCCGGGCGGGCAAGCGGCACGCCGGTCTCTATGAGTGCTCGGCCACCAAC 
GTACTGGGCAGCATCACCAGCTCCTGTACCGTGGCTGTGGCCCGAGTCCCAGGAAAGCTA 
GCTCCTCCAGAGGTAACCCAGACCTACCAGGACACGGCGCTGGTGCTGTGGAAGCCGGGA 
GACAGCCGGGCACCTTGCACGTATACGCTGGAGCGGCGAGTGGATGGGGAGTCTGTGTGG 
CACCCTGTGAGCTCAGGCATCCCCGACTGTTACTACAACGTGACCCACCTGCCAGTTGGC 
GTGACTGTGAGGTTCCGTGTGGCCTGTGCCAACCGTGCTGGGCAGGGGCCCTTCAGCAAC 
TCTTCTGAGAAGGTCTTTGTCAGGGGTACTCAAGATTCTTCAGCTGTGCCATCTGCTGCC 
CACCAAGAGGCCCCTGTCACCTCAAGGCCAGCCAGGGCCCGGCCTCCTGACTCTCCTACC 
TCACTGGCCCCACCCCTAGCTCCTGCTGCCCCCACACCCCCGTCAGTCACTGTCAGCCCC 
TCATCTCCCCCCACACCTCCTAGCCAGGCCTTGTCCTCGCTCAAGGCTGTGGGTCCACCA 
CCCCAAACCCCTCCACGAAGACACAGGGGCCTGCAGGCTGCCCGGCCAGCGGAGCCCACC 
CTACCCAGTACCCACGTCACCCCAAGTGAGCCCAAGCCTTTCGTCCTTGACACTGGGACC 
CCGATCCCAGCCTCCACTCCTCAAGGGGTTAAACCAGTGTCTTCCTCTACTCCTGTGTAT 
GTGGTGACTTCCTTTGTGTCTGCACCACCAGCCCCTGAGCCCCCAGCCCCTGAGCCCCCT 
CCTGAGCCTACCAAGGTGACTGTGCAGAGCCTCAGCCCGGCCAAGGAGGTGGTCAGCTCC 
CCTGGGAGCAGTCCCCGAAGCTCTCCCAGGCCTGAGGGTACCACTCTTCGACAGGGTCCC 
CCTCAGAAACCCTACACCTTCCTGGAGGAGAAAGCCAGGGGCCGCTTTGGTGTTGTGCGA 
GCGTGCCGGGAGAATGCCACGGGGCGAACGTTCGTGGCCAAGATCGTGCCCTATGCrrGCC 
GAGGGCAAGCCGCGGGTCCTGCAGGAGTACGAGGTGCTGCGGACCCTGCACCACGAGCGG 
ATCATGTCCCTGCACGAGGCCTACATCACCCCTCGGTACCTCGTGCTCATTGCTGAGAGC 
TGTGGCAACCGGGAACrCCTCTGTGGGCTCAGTGACAGGTTCCGGTATTCTGAGGATGAC 
GTGGCCACTTACATGGTGCAGCTGCTACAAGGCCTGGACTACCTCCACGGCCACCACGTG 
CTCCACCTAGACATCAAGCCAGACAACCTGCTGCTGGCCCCTGACAATGCCCTCAAGATT 
GTGGACTTTGGCAGTGCCCAGCCCTACAACCCCCAGGCCCTTAGGCCCCTTGGCCACCGC 
ACGGGCACGCTGGAGTTCATGGCTCCGGAGATGGTGAAGGGAGAACCCATCGGCTCTGCC 
ACGGACATCTGGGGAGCGGGTGTGCTCACTTACATTATGCTCAGTGGACGCTCCCCGTTC 
TATGAGCCAGACCCCCAGGAAACGGAGGCTCGGATTGTGGGGGGCCGCTTTGATGCCTTC 
CAGCTGTACCCCAATACATCCCAGAGCGCCT^CCCTCTTCTTGCGAAAGGTTCTCTCTGTA 
CATCCCTGGAGCCGGCCCTCCCTGCAGGACTGCCTGGCCCACCCATGGTTGCAGGACGCC 

tacctgatgaagctgcgccgccagacgctcaccttcaccaccaaccggctcaaggagttc 

CTGGGCGAGCAGCGGCGGCGCCGGGCTGAGGCTGCCACCCGCCACAAGGTGCTGCTGCGC 

tcctaccctggcggcccctagaggcacggaccacagccaggcctcgggcttcaactgggg 

ttcccacct^tgccacgggactvttccagggcccacgctgagccaggcgggcctgg^ 

cggttaccaccagcagcaacatctggctgggctcttacctcatagaccttcaaggacaga 

GACCCCAGGGCCTGGACCTGATGCCACCCCAGGCCAAAGCCAGAGTGGGAGACCCATTGG 
TCAGGCTCAGCAGGGTGGGAACAGGCAGAGGGACTUVGAGGGGAATGGAGAAGT^ 

aaaaggaatcgagggacaggaagggggaggctctaggaaggttctgggttgggggtct^gt 
gcatctcagggagaaccaaggaaggtgggcatggctggagaggaggaaaaggaaggagcc 
ccaggtgtcagggcagtaggctgggagtcagtgtggcaaagcgggggcaggacacagata 
cagtggcaggggcccagggctgggacatgagagaaggcagcgaggcggcagagggagaag 
agaggactcaggtggaggtggggtgggtcagctgtcagcatccctcagaggagaaatgtg 
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GAGAGCTGGAGGCCAGCAGTCACTCACACTCGCTCTGTCCTCCTGTCCAGTGGATACAGC 
CCTGGGCGCTCTGCTGGCCCAAGGATGTCCCCACTGCCCCTCCATGGCCTTTGGCCTTCT 
TCCCATTCATATTTATTTATTTATTGACTTTTATGAAGTTTCCCCTTCCATCCGATCCCT 
ACTGCCCATGTTGTCCTGACCATCCCTCCCAGCCATCCAGCTGTCTGTCTGTCTGCCACA 
AGGAAATAAAAATGGCAAGCAGCAAAAAAA 

SEQ ID NO: 43_AA542015_M SGK088_M 

GCCACGGACATCTGGGGAGCGGGTGTGCTCACCTACATCATGCTTAGTGGGTACTCCCCA 

TTCTATGAGCCAGACCCCCAGGAAACAGAGGCTCGGATTGTTGGGGGTCGCTTTGATGCC 

TTCCAGTTGTATCCTAACACATCCCAGAGTGCCACCCTCTTCTTGAGAAAGGTCCTCTCA 

GTACATCCCTGGAGCCGGCCCTCTCTGCAGGACTGCTTGGCCCACCCATGGCTGCAAGAT 

GCCTACCTGATGAAGCTGCGCCGCCAGACACTCACCTTCACCACCAACCGGCTCAAGGAA 

TTCCTGGGCGAGCAGCGGCGACGTCGGGCTGAGGCTGCTACCCGTCACAAGGTGCTGCTC 

CGCTCCTACCCTGGCAGCCCCTAGGTGGCACAGACCGCAGCCCGGCCACGGGCTTCAACT 

TGGGTTCTCACTCGCGCTGCCAAGGGACATTCCAGAGCCCATGCTGAGCTGGACAGGCAG 

GGGCTTCAGATACCAGCAGCAGCAGCAGCAGCAGCAGCAGCAACATCTGGCTGGGCTATT 

ACCTCATGGACCTAAGAGGACAAGGCCCTGGGGCTTCT^GCCGAATGTCACCCCGGCCATA 

ACCT^GAGCAGGAGACCCACTGGCCAGGCTGGGCAAGGGTGAGAGCAGAAAGAGGC^^ 

GGGAGTTGGGAAGTGAAGAATGAGACGGAGGATAGAGAGGGAGGAGTTTGAGGAAGGTTC 

TAGGCTGGAGTGGAATGCTATATCTCAGGGAGAAGCCAGAAGGGGACATGGCTGAAGAGG 

AAGAAGGACCCTGTGATGTGGGAATGTGGTGGAGAGGAGGACTGGACATAGAGAGTGTGC 

CAGGAGCCAGAGCAGAGACATAAGGGAGGGCAGAAGGGTAGAAGGCTVACAGGAGTG^ 

CAGGGGTGGCAGGGaVGGCCAGCA.GCTGCATCTTCAGAAAGAGAGAGGAGAAAGGC^^ 

AGACGAAAGGCCGCTCCAGCTGGTCTCCTGTCCCAGCCGATGCAGTTCTGGGCGTTCTCC 

ACTGGCCCAGGGATGTCCTCACTGCTCCTCCATGGCCTTTGCCCTCCTTCCCATTTGTAT 

TTATTTATTTATTGCCTTTTGTGGAGTTTCCTTTCTATCCAGTCCCTAGTGCCTATGTTG 

TCCCGACCATCCCCCTTCAGTCACCCAGCTGTCTGTGCAGCTGTCTGTCTGTCTGTCACA 

AGGAAAATAAAACAAAACAAAACAAAACAAAACAAACAAA 

SEQ ID NO: 44_R19772_H 

ATGAAGGGCGGCGACAGGGCTTACACCCGAGGTCCCTCTTTGGGGTGGCTCTTTGCTAAG 
TGCTGCTGTTGCTTCCCGTGTAGAGATGCATACTCTCATTCCTCAAGCGAGAATGGAGGC 
AAGTCCGAGTCCGTAGCCAACCTGCAGGCCCAGCCCTCCCTGAACTTCATCCACAGTTCC 
CCGGGTCCCAAGCGCTCCACCAACACTCTTAAGAAGTGGCTGACGAGTCCTGTGCGTCGG 
CTCAACAGCGGGAAGGCAGATGGAAACATCAAAAAGCAGAAGAAAGTTCGCGATGGTCGG 
AAGAGCTTTGACCTGGGATCTCCCAAGCCTGGGGATGAAACAACCCCTCAGGGAGACAGC 
GCTGATGAGAGCAAGAAAGGTTGGGGTGAAGATGAGCCGGATGAAGAGTCACACACACCC 
CTCCCACCACCTATGAAGATTTTTGACAACGACCCTACACAGGATGAAATGTCCTCCTCT 
TTGCTAGCAGCCCGGCAGGCTTCCACTGAAGTACCTACTGCTGCAGACCTTGTCAATGCA 
ATAGAAAAGTTGGTCAAAAACAAGCTGAGTCTAGAAGGAAGCTCATACCGGGGGAGCTTG 
AAAGACCCTGCAGGCTGCCTGAATGAGGGGATGGCCCCACCCACACCTCCTAAAAATCCA 
GAAGAAGAACAGAAAGCCAAGGCCCTGAGAGGCAGGATGTTTGTCCTGAATGAGCTGGTA 
CAGACAGAGAAAGACTATGTCAAGGATCTGGGCATTGTGGTGGAGGGCTTCATGAAGAGA 
ATAGAAGAAAAGGGTGTCCCTGAGGATATGCGAGGAAAGGACAAAATCGTGTTTGGAAAT 
ATTCATCAGATTTATGACTGGCATAAGGATTTTTTCCTGGCGGAACTGGAAAAGTGTATC 
CAGGAGCAAGACAGATTGGCACAGCTCTTTATTAAGCACGAGCGGAAGCTGCACATCTAC 
GTGTGGTATTGTCAGAATAAGCCGCGCTCAGAGTACATCGTTGCTGAGTATGACGCCTAC 
TTTGAGGAGGTAAAACAGGAGATAAATCAGAGGCTGACACTGAGTGACTTCCTCATCAAG 
CCCATTCAGAGAATAACAAAATACCAGTTGCTCCTCAAGGACTTCCTGAGATACAGTGAG 
AAGGCTGGTTTGGAGTGTTCAGATATCGAGAAAGCAGTGGAGTTAATGTGCCTTGTTCCC 
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AAACGCTGCAATGACATGATGAATCTAGGACGTCTGCAGGGCTTTGAGGGCACTCTGACT 

GCTCAGGGGAAGCTACTGCAGCAGGACACATTCTATGTGATCGAGCTGGATGCAGGCATG 

CAGTCCCGGACCAAAGAGAGGCGCGTGTTCCTCTTCGAGCAGATTGTCATCTTCAGTGAA 

CTGCTCAGGAAGGGATCCCTCACCCCTGGCTACATGTTCAAAAGGAGCATCAAGATGAAT 

TACTTGGTCCTGGAGGAGAATGTGGACAATGATCCCTGCAAGTTTGCACTCATGAACAGA 

GAGACTTCTGAGAGGGTTGTTCTGCAAGCCGCCAACGCTGACATCCAGCAGGCCTGGGTG 

CAGGACATCAATCT^GTCTTAGAAACACAGCGAGACTTTTTGAATGCACTGC^ 

ATTGAGTATCAACGGAAAGAAAGGAGCACAGCTGTGATGAGGTCTCAACCTGCCAGGCTT 

CCCCAAGCCAGCCCCAGGCCCTACTCCTCTGTTCCTGCGGGCTCAGAGAAGCCCCCAAAG 

GGCTCCAGCTATAACCCACCTCTGCCTCCCCTGAAGATATCTACCTCCAATGGCAGTCCA 

GGGTTTGAATACCACCAGCCTGGGGACAAGTTCGAAGCCAGCAAGAACGACCTGGGAGGC 

TGCAATGGGACCTCGTCCATGGCCGTGATCAAAGATTACTATGCACTGAAGGAGAATGAA 

ATCTGTGTGAGCCAAGGTGAGGTGGTCCAGGTCCTCGCCGTCAACCAGCAGAACATGTGT 

CTGGTGTACCAGCCTGCCAGCGACCATTCCCCCGCCGCCGAGGGCTGGGTCCCAGGCAGC 

ATCCTGGCGCCCCTCACCAAAGCCACAGCAGCAGAAAGTAGTGACGGGAGCATCAAGAAG 

TCATGTTCATGGCATACTCTACGCATGAGAAAGCGGGCGGAAGTGGAGAACACGGGTAAA 

AATGAAGCCACAGGGCCTCGTAAACCCAAGGATATTCTGGGCAACAAAGTCTCTGTTAAA 

GAGACGAACAGTTCCGAGGAATCAGAGTGTGATGATCTTGACCCTAATACTAGCATGGAG 

ATCTTAAATCCAAATTTCATCCAAGAAGTGGCCCCAGAATTCCTTGTGCCCTTGGTGGAT 

GTGACCTGCTTGCTTGGGGACACAGTGATACTGCAGTGCAAAGTCTGTGGGCGGCCAAAG 

CCCACCATCACTTGGAAGGGTCCAGACCAGAACATCCTTGACACTGATAACAGCTCAGCC 

ACATACACGGTCTCCTCTTGTGATTCTGGAGAAATCACCCTGAAGATCTGTAATCTGATG 

CCCCAAGACAGTGGGATTTATACCTGCATAGCAACAAATGACCACGGGACCACATCT^ 

TCTGCAACAGTCAAAGTGCAAGGTGTTCCAGCAGCCCCTAACCGCCCCATTGCCC^ 

AGAAGCTGCACCTCCGTGATTCTCCGCTGGCTGCCCCCCTCCAGCACAGGAAACTGCACT 

ATTTCTGGTTACACTGTGGAGTACAGAGAGGAAGGTTCTCAGATCTGGCAGCAGTCAGTG 

GCTTCGACCTTGGACACTTACCTCGTCATCGAAGACCTTAGTCCCGGGTGTCCTTATCAG 

TTCAGAGTCAGTGCCAGTAACCCCTGGGGAATCAGCCTTCCCAGCGAGCCCTCGGAGTTT 

GTGCGACTTCCAGAATACGATGCTGCTGCTGATGGTGCCACCATTTCTTGGAAGGAAAAT 

TTTGACTCAGCTTACACTGAGCTGAATGAAATTGGAAGAGGCCGTTTCTCTATAGTAAAG 

AAATGCATTCACAAAGCTACCCGCAAAGATGTGGCTGTGAAATTTGTTAACAAAAAAATG 

AAGAAGAAAGAACAGGCTGCCC7VCGAGGCTGCCCTGCTTCAGCACCTACAGCACCCCCAG 

TACATCACTCTCCATGACACCTATGAGTCCCCCACATCCTACATCCTGATCTTGGAACTG 

ATGGATGATGGCCGGCTCTTAGACTACCTTATGAATCATGATGAACTGATGGAGGAAAAA 

GTAGCTTTCTATATCCGAGACATCATGGAGGCTCTGCAGTACCTTCACAACTGCAGGGTT 

GCACATTTGGACATAAAGCCTGAAAACCTGCTCATTGACCTACGGATTCCAGTGCCTCGA 

GTGAAGCTCATTGACTTGGAGGATGCTGTCCAGATCTCGGGTCACTTCCACATTCACCAC 

CTGCTGGGGAACCCTGAGTTTGCTGCCCCAGAAGTCATTCAAGGCATCCCCGTCTCCCTG 

GGGACAGACATCTGGAGCATCGGGGTTCTGACATATGTCATGCTGAGTGGGGTCTCCCCC 

TTCTTGGATGAGAGCAAAGAGGAGACATGTATCAACGTATGCAGGGTGGATTTCAGCTTC 

CCCCATGAATACTTCTGTGGTGTGAGCAATGCTGCCAGAGATTTCATCAATGTGATCTTA 

CAGGAAGATTTTCGGAGGCGGCCCACAGCAGCCACATGCTTGCAGCATCCATGGCTGCAG 

CCCCATAATGGCAGCTACTCTAAGATCCCCCTGGACACCTCCCGCCTAGCATGCTTCATA 

GAACGTCGCAAGCACCAGAATGATGTGCGGCCTATCCCCAATGTCAAGAGCTACATTGTC 

AACCGGGTGAACCAAGGGACGTAG 

SEQ ID NO: 45_5R72_8_2_H 

CGCCGCTGTTTGTCCTCGCGCGGCCCCGTCCACTGCCCTGCGGTTGCTCTGCGGGCTGAA 
AAGTTTCTCCCGGTGCAGAATTCCGGGCTCAGCGACAGCCTGCGCCGAGTGTGCGCACCT 
GTCGGAGACCCGCCAGTCCGCCGGCCCCGGCTTTGTTCGTGCGGAACTGTAGTGGTGAGA 
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AAAACTCCATGTCTGGGCACGCCTGGCTGATCTTCACCTCTTTCTTCTAGGACCTTCCTC 

TGGGCTGTCACGTGTGAATATGTGTCTAGTGCATCCTTAACCTGAGGACTTCACCAGTTC 

GAAATTACAGTTTTCACCATCAACTACCTTATCCTTTTTGGCCTGGTTTTCTTCCTCAAA 

CAGTGGAAACATTTTTAAAGTTGCTTTTGTTGCAGAGTTAAACAAATGGCTGATAGTGGC 

TTAGATAAAAAATCCACAAAATGCCCCGACTGTTCATCTGCTTCTCAGAAAGATGTACTT 

TGTGTATGTTCCAGCAAAACAAGGGTTCCTCCAGTTTTGGTGGTGGAAATGTCACAGACA 

TCAAGCATTGGTAGTGCAGAATCTTTAATTTCACTGGAGAGAAAAAAAGAAAAAAATATC 

AACAGAGATATAACCTCCAGGAAAGATTTGCCCTCAAGAACCTCAAATGTAGAGAGAAAA 

GCATCTCAGCAACAATGGGGTCGGGGCAACTTTACAGAAGGAAAAGTTCCTCACATAAGG 

ATTGAGAATGGAGCTGCTATTGAGGAAATCTATACCTTTGGAAGAATATTGGGAAAAGGG 

AGCTTTGGAATAGTCATTGAAGCGACAGACAAGGAAACAGAAACGAAGTGGGCAATTAAA 

AAAGTGAACAAAGAAAAGGCTGGAAGCTCTGCTGTGAAGTTACTTGAACGAGAGGTGAAC 

ATTCTGAAAAGTGTAAAACATGAACACATCATACATCTGGAACAAGTATTTGAAACGCCA 

AAGAAAATGTACCTTGTGATGGAGCTTTGTGAGGATGGAGAACTCAAAGAAATTCTGGAT 

AGGAAAGGGCATTTCTCT^GAGAATGAGACAAGGTGGATCATTCAAAGTCTCGCATCAGCT 

ATAGCATATCTTCACAATAATGATATTGTACATAGAGATCTGAAACTGGAA7ATATAATG 

GTTAAAAGCAGTCTTATTGATGATAACAATGAAATAAACTTAAACATAAAGGTGACTGAT 

.TTTGGCTTAGCGGTGAAGAAGCAAAGTAGGAGTGAAGCCATGCTGCAGGCCACATGTGGG 

ACTCCTATCTATATGGCCCCTGAAGTTATCAGTGCCCACGACTATAGCCAGCAGTGTGAC 

ATTTGGAGCATAGGAGTCGTAATGTACATGTTATTACGTGGAGAACCACCCTTTTTGGCA 

AGCTCAGAAGCGAAGCTTTTTGAGTTAATAAGAAAAGGAGAACTACATTTTGAAAATGCA 

GTCTGGAATTCCATAAGTGACTGTGCTAAAAGTGTTTTGAAACAACTTATGAAAGTAGAT 

CCTGCTCACAGAATCACAGCTAAGGAACTACTAGATAACCAGTGGTTAACAGGCT^TAAA 

CTTTCTTCGGTGAGACCAACCAATGTATTAGAGATGATGAAGGAATGGAAAAATAACCCA 

GAAAGTGTTGAGGAAAACAC^CAGAAGAGAAGAATAAGCCGTCCACTGAAGAAAAGTTG 

AAAAGTTACCAACCCTGGGGAAATGTCCCTGAGACCAATTACACTTCAGATGAAGAGGAG 

GAAAAACAGTCTACTGCTTATGAAAAGCAATTTCCTGCAACCAGTAAGGACAACTTTGAT 

ATGTGCAGTTCAAGTTTCACATCTAGCAAACTCCTTCCAGCTGAAATCAAGGGAGAAATG 

GAGAAAACCeCTGTGACTCCAAGCCAAGGAACAGCAACCAAGTACCCTGCTAAATCCGGC 

GCCCTGTCCAGAACCAAAAAGAAACTCTAAGGTTCCCTCCAGTGTTGGACAGTACAAAAA 

CAAAGCTGCTCTTGTTAGCACTTTGATGAGGGGGTAGGAGGGGAAGAAGACAGCCCTATG 

CTGAGCTTGTAGCCTTTTAGCTCCACAGAGCCCCGCCATGTGTTTGCACCAGCTTAAAAT 

TGAAGCTGCTTATCTCCAAAGCAGCATAAGCTGCTVCATGGCATTAAAGGACAGCC^^ 

TAGGCTTGGCAGTGGGCTGCAGTGGAAATCAACTCAAGATGTACACGAAGGTTTTTTAGG 

GGGGCAGATACCTTCAATTTAAGGCTGTGGGCACACTTGCTCATTTTT^^ 

TATGTTTAGGCACAGCTATTTATAGGGGAAAACAAGAGGCCAAATATAGTAATGGAGGTG 

CCAAATAATTATGTGCACTTTGCACTAGAAGACTTTGTTAGAAAATTACTAATAAACTTG 

CCATACGTATTACAGCAGAAGTGCTTCAGTCATTCACATGTGTTCGTGAGATTTTAGGTT 

GCTATAGATTGTTTAAGACAGCTTATTTTAAATGTAGAAAAATAGGAGATTTTGTAACTG 

CTTGCCATTAACTTGCTGCTAAATTCCCAATGTATTGATTAAATCAATAAAAAACAG^ 

TTACTC 

SEQ ID NO: 46_SGK309_H 

GGGTCCGCAGCCCGCCCTCACAGGCCCTCCTCACTCCCCTAGGTAGATGGCCCCCTCAGG 
GCAGGCCCGGCGGACACCCCTCCCTCTGGCTGGCGGATGCAGTGCCTAGCGGCCGCCCTT 
AAGGACGAAACCAACATGAGTGGGGGAGGGGAGCAGGCCGACATCCTGCCGGCCAACTAC 
GTGGTCAAGGATCGCTGGAAGGTGCTGAAAAAGATCGGGGGCGGGGGCTTTGGTGAGATC 
TACGAGGCCATGGACCTGCTGACCAGGGAGAATGTGGCCCTCAAGGTGGAGTCAGCCCAG 
CAGCCCAAGCAGGTCCTCAAGATGGAGGTGGCCGTGCTCAAGAAGTTGCAAGGTTCGGGC 
CTCGGGCAGGGGGATGGGAAGGAAGAGATGATGAAGCCAGGGGCTAAGAGAGGGAAGGAC 
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CATGTGTGCAGGTTCATTGGCTGTGGCAGGAACGAGAAGTTTAACTATGTAGTGATGCAG 
CTCCAGGGCCGGAACCTGGCCGACCTGCGCCGTAGCCAGCCGCGAGGCACCTTCACGCTG 
AGCACCACATTGCGGCTGGGCAAGCAGATCTTGGAGTCCATCGAGGCCATCCACTCTGTG 
GGCTTCCTGCACCGTGACATCAAGCCTTCAAACTTTGCCATGGGCAGGCTGCCCTCCACC 
TACAGGAAGTGCTATATGCTGGACTTCGGGCTGGCCCGGCAGTACACCAACACCACG^^ 
GATGTGCGGCCCCCTCGGAATGTGGCCGGGTTTCGAGGAACGGTTCGCTATGCCTCAGTC 
AATGCCCACAAGAACCGGGAGATGGGCCGCCACGACGACCTGTGGTCCCTCTTCTACATG 
CTGGTGGAGTTTGCAGTGGGCCAGCTGCCCTGGAGGAAGATCAAGGACAAGGAACAGGTA 
GGGATGATCAAGGAGAAGTATGAGCACCGGATGCTGCTGAAGCACATGCCGTCAGAGTTC 
CACCTCTTCCTGGACCACATTGCCAGCCTCGACTACTTCACCAAGCCCGACTACCAGTTG 
ATCATGTCAGTGTTTGAGAACAGCATGAAGGAGAGGGGCATTGCCGAGAATGAGGCCTTT 
GACTGGGAGAAGGCAGGCACCGATGCCCTCCTGTCCACGAGCACCTCTACCCCGCCCCCA 
GCAGAACACCCGGCAGACGGCAGCCATGTTTGGGGTGGTCAATGTGACGCCAGTGCCTGG 
GGACCTGCTCCGGGAGAACACCGCGGATGTGCTACAGGGAGAGCACCTGAGTGACCAGGA 
GAATGCACCCCCAATTCTGCCCGGGAGGCCCTCTGAGGGGCTGGGCCACAGTCCCCACCT 
TGTCCCCCACCCCGGGGGTCCTGAGGCTGAAGTCTGGGAGGAGACAGATGTCAACCGGAA 
CAAACTCCGGATCAACATCGGCAAAGTAACTGCCGCCAGGGCGAAGGGCGTGGGTGGCCT 
TTTCTCTCACCCCCGATTCCCAGCCTTGTGCCCCTGCCCTGTTCCTCCTAAGCACCCTGT 
CCCCCGCCAATCTCCCTGCTTGCCCGGCCTCTGTTTCCGGTCCCCTCCCCGGCACTAGCC 
TCGCTGTGTCTTCCATCATCATCATCCTCTGTCTCCTTCACACTGAGGAGACCATCCGCC 

SEQ ID NO: 47_AA234451_H 

GGCGCGCCAGATATCACACGTGCCAAGGGGCTGGCTCAGCGGCGGCGGCGGCAGGAGGGG 

GAGCT^GGTGCTGGCACAAGAGCAGCGGCTTGGGGGAGCCGGCAGCAGCAGTAACAGCAG^ 

AGCAGCCGCCGCCGCCGCCGCCAGTAAACGCGGACCGTACCCCAGGGGACTACCCAGCCG 

GCCGGCCCTGGAAGCCGCGCTCGGGTCCCGCCGCAGTCGGCGGTGGGGGATGGGCAGGCA 

GTGGCGGTCCCGCCTGCCGAGGGTTAACCCCCGCCGGTCCCGGTCCTGAGCTGGACCAGA 

GCCCTCCTCCAGAAACCCCTGCGTCCGCCACGGCCCAGGTTAAATGGAAACCACCCTTGG 

GAACTGGATGCCTGTGTAGCTGTTCTACCATATCAGTGTATTGCAATGAGTGGGGGAGGA 

GAGCAGCTGGATATCCTGAGTGTTGGAATCCTAGTGAAAGAAAGATGGAAAGTGTTGAGA 

AAGATTGGGGGTGGGGGCTTTGGAGAAATTTACGATGCCTTGGACATGCTCACCAGGGAA 

AATGTTGCACTGAAGGTGGAATCAGCTCAACAACCAAAACAAGTTCTGAAAATGGAA^ 

GCTGTTTTGAAAAAGCTGCAAGGGAAAGACCATGTTTGTAGATTTATTGGCTGTGGGAGG 

AATGATCGATTCAACTATGTGGTCATGCAGTTGCAGGGTCGGAATCTGGCAGATCTTCGC 

CGTAGCCAGTCCCGAGGCACATTCACCATTAGTACCACTCTCCGGCTGGGTAGACAGATT 

TTGGAGTCTATTGAAAGCATTCATTCTGTGGGATCTTGNCATCGAGACATCAAACCGTCG 

AACTTCGCTATGGGTCGCTTTCCTAGTACATGTAGGAAATGTTACATGCTTGATTTTGGC 

TTGGCTCGACAATTTACCAATTCCTGTGGTGACGTCAGACCACCTCGAGCTGTGGCAGGT 

TTTCGAGGGACAGTTCGTTATGCATCAATCAACGCTICATCGGAACAGGGAAATG 

a^TGATGACCTTTGGTCCTTATTCTACATGTTGGTGGAGTTTGTGGTTGGTCAGCTGCCC 

TGGAGAAAAATAAAGGACAAGGAGCAAGTAGGCTCTATTAAGGAGAGATATGACCACAGG 

CTCATGTTGAAACATCTCCCTCCAGAATTCAGCATCTTTCTAGACCATATCTCTTCTTTG 

GATTATTTTACAAAACCAGACTACCAGCTTCTTACATCCGTGTTTGACAATAGCATCAAG 

ACTTTTGGAGTAATTGAGAGTGACCCTTTTGACTGGGAGAAGACTGGAAATGATGGCTCC 

CTAACAACCACCACTACTTCTACCACCCCTCAGTTGCACACTCQCTTGACCCCTGCTGCA 

ATTGGAATTGCCAATGCTACTCCCATCCCTGGAGACTTGCTTCGAGAAAATACAGATGAG 

GTATTTCCAGATGAACAGCTTAGCGATGGAGAAAATGGCATCCCTGTTGGTGTGTCACCA 

GATAAATTGCCTGGATCTCTGGGACACCCCCGTCCCCAGGAGAAGGATGTTTGGGAAGAG 

ATGGATGCCAACAAAAACAAGATAAAGCTTGGAATTTGTAAGGCTGCTACTGAAGAGGAG 

AACAGCCATGGCCAGGCAAATGGTCTTCTCAATGCTCCAAGCCTTGGGTCACCAATTCGT 




wo 00/73469 



PCT/USOO/14842 



FIGURE 2MM 

GTCCGCTCAGAGATTACTCAGCCAGACAGAGATATTCCACTGGTGCGAAAGTTACGTTCC 
ATTCACAGCTTTGAGCTGGAAAAACGTCTGACCCTGGAGCCAAAGCCAGACACTGACAAG 
TTCCTTGAGACCTGGTATAAAATAGTGTATTTTTCTTTTTAAAGCTTCTAAGGTACCATT 
ATTATTGTTGTCATTGTTGTTATTATTATTGTATATTTCTGTTACATAAAGTCTTTCAAA 
TAAGAAATCCTTGCATTTTTGTAACACTGAGTCTATTCAGCTCCAATTTTCATCCATGTT 
TTTAATTATTATTATCCTGATTCTTAATTATTATAAATTCTATAGCATATCCTTTGGCTT 
TGGAAGCTGAGCAGTAAGAGCTGATGACTTCCTAACACTAGGTACAAGTTAAATGAACAT 
TTTTACAGTAACTTTGTTTAGAAAGTAATCTCTTCCACACAACAGTGTAGTGCTGGAGAG 
GGCATGATAAAGATGGCATTAGGCAGAGATGAGGGGAATACATAAAGGAGGGGAAAAAGT 
AATTCATACACAAGGGACGGTGAGTTCAATTCACTTTAGTGAAGACCCTCTAGGAGTAAG 
ATACTGTGGGAAAACAGATACCAATAAGTATATCATGCTTGCCCTAGAGAGTTTGCAATC 
TACCTAGAGAGAAAGGAAGGTGAAACTTGAGAGATCTATATACATAGGTAAAGATTGTAG 
TGCATGGTTTTGAGGCACATTATCCCTACAACAAATTTTGATAACAGAAGAC 

SEQ ID NO: 48_AA435956_H 

ACTTTTACTATATTCTTTGAGATGACTGTTTTTGATTTAGAGGCGAAATCAGCACGTGGT 

GGCTCAAATCTCCTTATGGATAGTGTTTCTTCCTTCCTIGCTTTTCATGTTTCAACTTTTG 

CGGGGCCTGGCGTACATCCACCACCAACACGTTCTTCAC7VGGGACCTGAAACCTCAGAAC 

TTACTCATCAGTCACCTGGGAGAGCTCAAACTGGCTGATTTTGGTCTTGCCCGGGCCAAG 

TCCATTCCCAGCCAGACATACTCTTCAGAAGTCGTGACCCTCTGGTACCGGCCCCCTGAT 

GCTTTGCTGGGAGCCACTGAATATTCCTCTGAGCTGGACATATGGGGTGCAGGCTGCATC 

TTTATTGAAATGTTCCAGGGTCAACCTTTGTTTCCTGGGGTTTCCT^CATCCTTGAACAG 

CTGGAGAAAATCTGGGAGGTGCTGGGAGTCCCTACAGAGGATACTTGGCCGGGAGTCTCC 

AAGCTACCTAACTACAATCCAGAATGGTTCCCACTGCCTACGCCTCGAAGCCTTCATGTT 

GTCTGGAACAGGCTGGGCAGGGTTCCTGAAGCTGAAGACCTGGCCTCCCAGATGCTAAAA 

GGCTTTCCCAGAGACCGCGTCTCCGCCCAGGAAGCACTTGTTCATGATTATTTCAGCGCC 

CTGCCATCTCAGCTGTACCAGCTTCCTGATGAGGAGTCTTTGTTTACAGTTTCAGGAGTG 

AGGCTAAAGCCAGAAATGTGTGACCTTTTGGCCTCCTACCAGAAAGGTCACCACCCAGCC 

CAGTTTAGCAAATGCTGGTGAAAAGAAAGGGCGAGATCACCAAGGTTCTTCCAGGGCTGT 

ATTTCTGCAGTTTCGGTTTTCATTTGCTTCAGCTTACTAAGAAGCTTCAAATCTAACTCC 

ATACTGAACAAGGGGCTTTATGTCCTCACCTATGACCTGGAATAGTTTAAATATGGTGTT 

CAAGGCAATAGTACATAATAGTGGAAGAAAATTCTVGTGGAAGGTTATTGCTATTGTC^ 

TGCATAGAATTTAAGTGATTGATTTAAAAAAACTGGACATAAACTAAGTCTAAGAAG 

SEQ ID NO: 4 9_AA626859_H 

AAATGGAGTTGCTGATGGAGTGATCAAAAGCGTATTATGGCAAACACTTCAAGCTCTTAA 

TTTCTGTCATATACATAACTGTATTCACAGAGATATAAAACCTGAAAATATTCTAATAAC 

TAAGCAAGGAATAATCAAGATTTGTGACTTCGGGTTTGCACAAATTCTGATTCCAGGAGA 

TGCCTACACCGATTATGTAGCTACGAGATGGTACCGAGCTCCTGAACTTCTTGTGGGAGA 

TACTCAGTATGGTTCTTCAGTCGATATATGGGCTATTGGTTGTGTTTTTGCAGAGCTCCT 

GACAGGCCAGCCACTGTGGCCTGGAAAATCAGATGTGGACCAACTTTATCTGATAATCAG 

AACACTAGGAAAATTAATCCCAAGACATCAATCAATCTTTAAAAGTAACGGGTTT^ 

TGGCATCAGTATACCTGAGCCAGAAGACATGGAAACTCTTGAGGAAAAGTTCTCAGATGT 

TCATCCTGTGGCTCTGAACTTCATGAAGGGGTGTCTGAAGATGAATCCAGATGACAGATT 

AACCTGTTCCCAACTCCTGGAGAGCTCCTACTTTGATTCTTTTCAAGAGGCCCAAATTAA 

AAGAAAAGCACGTAATGAAGGAAGAAACAGAAGACGCCAACAGAATCAACTGTTGCCTCT 

CATACCAGGAAGCCACATCTCCCCCACACCTGATGGAAGAAAACAAGTCCTCCAGTTAAA 

ATTTGATCACCTTCCAAACATTTAGGAAAATGTTCTTTCAAGTGCAAAGTAATTTAATAT 

GTACACATTTTGTACAAGTGAGATAGGAATTCTCAGTGTTTCAAATGCAAATGAGCCATA 
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TGAAAATTAAGATGCCTTCTAGAATrGGTTTGCTCTGATCATTGCTGATTCCTTTCCCCA 
TGCTTTTACAT 

SEQ ID NO: 5 0_AAO 6 1 7 9 7_M 

GAAAATAGCCCTGCGGGAAATCCGTATGCTGAAGTTGAAACACCCAAACCTCGTGAACCT 

CATCGAGGTGTTCAGAAGAAAGAGAAAGATGCATCTAGTTTTTGAGTACTGTGATCACAC 

ACTGTTAAACGAGCTGGAGAGAAACCCAAACGGAGTTTCTGATGGAGTGATTAAAAGTGT 

GCTATGGCAAACCCTTCAAGCCCTTAACTTCTGTCACAAGCACAATTGTATTCATCGGGA 

TGTAAAACCTGAAAACATCCTAATAACCAAGCAAGGGATGATAAAGATTTGTGACTTTGG 

ATTTGCACGAATTCTAATTCCAGGAGACGCCTACACAGACTATGTTGCCACCAGGTGGTA 

CCGAGCCCCCGAACTTCTCGTGGGAGACACGAAGTACGGTTCCTCTGTAGACGTGTGGGC 

CGTCGGCTGTGTTTTTGCAGAGCTCCTGACGGGTCAGCCACTCTGGCCGGGAAAATCCGA 

CGTGGACCAGCTTTACCTGATCATCAGGACGTTGGGGAAGCTGATTCCAAGACACeAGTC 

TATCTTTAGGAGTAACCAGTTTTTCCGCGGCATCAGCATACCTGAACCAGAGGACATGGA 

GACTCTTGAAGAAAAATTCTCAAATGTTCAGCCTGTGGCTTTAAGTTTCATGAAGGGATG 

CCTGAAGATGAATCCTGATGAGAGGCTGACCTGTGCCCAGCTGCTGGACAGTGCCTACTT 

TGAGTCTTTTCAAGAGGATCAAATGAAAAGAAAAGCCCGCAGTGAGGGGAGAAGCCGAAG 

GCGCCAGCAGAATCAACTGCTGCCTCTTATTCCTGGAAGCCACATCTCCCCCACACCTGA 

TGGAAGGAAACAAGTCGTCCAGTTAAAGTTCGATCATCTTCCAAACATTTAGGGGACTC 

TCCTTCCCAGCACATCCTTTTAATATTGTCTACATAGGAATAAGACGGGAATCCTCAGCA 

TCTCAAATACAGTGAGCGACGTGAACACCAGGGCACCTCTAATCACCACGGGCTCCTCCC 

CTGTGCTTTTTCCACGCCAGCTCCATCTCCTAAAACATTCTCTTTAAATGTTGCAGTATC 

AAAATGGCACATCCGAAAGAGATGCTTCCAGTTTCACCAGAGCCGGGCTTCCTCAGGCAA 

TCGGTACTGTGCATCTGTGGACTTATGCTCCGACCTAGGGAAAGATTTCCACGTAGCCGT 

CTCACTTCAGCCGACCAGTGGTGTCCTGAAGCAGACCCAGATCTGCTGGCTGCTGTTTGT 

GGAGGGGATGGCCCTGAGCCCTCTCACTGGAGTTTCTTCTCCGTGCAGCCAGGTCTTACT 

TTAGACTAaVTTTGTGTTATTGTGGCATGGCAATCGTGAAAGGTGGTCTAGGTTTACCCT 

TGACTCCACAGCAGATGCTAGTCTCCTTCTCGTGAGGAGCTGACAAGTCTGCTTCTAAAA 

CGAACTAGAGAAAATTCCAAACGTGACCAGTTAGTGGACAGACTACAAGGAATCGACCAC 

CATACCACAGTAACGCCCCTGGATCCCTGGCTGCCCACCCACTCTAAGGCTATCCTGGTT 

CACCATGGTTTCTCTTTCTTTTCTTCTTTTTTAATCTATTTGTACATATGAGAAAGAGGC 

AGAGGGGCGAGAGAAACCTCGTGTGTGAAAATCAAAAGACAAGCAGGAGGCCAGCCTAAG 

CTACATAGCAAGGCCTTTTCTCTACACCCATTCTCTAAGGTTGCTTAAACCCA^ 

GCTGCTGATTGTATAAACTATGAATAAGTTCTACATATGTAGGACATATTGTTGTCATTG 

TTGAAATATCTAAGGATCTTGGTAGAAGCAGAAGTGTTCTAAATATTCTCCACACTGGTG 

AGTATCTTGGCATTTCATTTCTGACCTCATCACAGATGAACACATCAAAGGATGAGTATG 

tatcactttgcatcttagaattctacctgttttagctgcgttaaaccttgtgaaagggcg 

gggccataactgaacctgtggagttcttgcctgtgtgcaggaaaccctctggttttgtct 

ccagcatggaagaaaacagctatagtcacacctacctgaaagtagaaattcaaagtc^ 

gtccttgactacatatgcagtccaaggccacgctgggctacacttctccaggcatgaagg 

tccgtgtttgtatcaaggggcaggaaaggagagtccaaggtcaaggccagccgaggctgc 

atagtgagttgaggctcttcagcaaaagaaaagcaaactaataggagtcgttgaaggtag 

ccaccggccatttctctaaatatcattctgctgaaaagggggcttagtttagtttagaat 

gcatituitgtatgtagaagctgggctatttcagattatttgaaattgtagctattgtt;^ 

ttagcacttaataactaactagcattatggtagtctaaactattagagtttactacaaag 

aggttttgattgaattatattaaactitataatatggattttaaaaatttaagatgtttaa 

gaaagctatataaagattaaacatttttgtggctgtatatttgtgtatataccttggttg 

ttctttaaattattttaataaaagccagaaacatt 
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SEQ ID NO: 51_AA397553_H 

ATGCCCAATTCAGAGAGACATGGGGGCAAGAAGGACGGGAGTGGAGGAGCTTCTGGAACT 

TTGCAGCCGTCATCGGGAGGCGGCAGCTCTAACAGCAGAGAGCGTCACCGCTTGGTATCG 

AAGCACAAGCGGCATAAGTCCAAACACTCCAAAGACATGGGGTTGGTGACCCCCGAAGC^ 

GCATCCCTGGGCACAGTTATCAAACCTTTGGTGGAGTATGATGATATCAGCTCTGATTCC 

GACACCTTCTCCGATGACATGGCCTTCAAACTAGACCGAAGGGAGAACGACGAACGTCGT 

GGATCAGATCGGAGCGACCGCCTGCACAAACATCGTCACCACCAGCACAGGCGTTCCCGG 

GACTTACTAAAAGCTAAACAGACCGAAAAAGAAAAAAGCCAAGAAGTCTCCAGCAAGTCG 

GGATCGATGAAGGACCGGATATCGGGAAGTTCAAAGCGTTCGAATGAGGAGACTGATGAC 

TATGGGAAGGCGCAGGTAGCCAAAAGCAGCAGCAAGGAATCCAGGTCATCCAAGCTCCAC 

AAGGAGAAGACCAGGAAAGAACGGGAGCTGAAGTCTGGGCACAAAGACCGGAGTAAAAGT 

CATCGAAAAAGGGAAACACCCAAAAGTTACTUU^CAGTGGACAGCCCAAAACGGAGAT 

AGGAGCCCCCACAGGAAGTGGTCTGACAGCTCCAAACAAGATGATAGCCCCTCGGGAGCT 

TCTTATGGCCT^GATTATGACCTTAGTCCCTCACGATCTCATACCTCGAGCAATTATGAC 

TCCTACAAGAAAAGTCCTGGAAGTACCTCGAGAAGGCAGTCGGTCAGTCCCCCTTACAAG 

GAGCCTTCGGCCTACCAGTCCAGCACCCGGTCACCGAGCCCCTACAGTAGGCGACAGAGA 

TCTGTCAGTCCCTATAGCAGGAGACGGTCGTCCAGCTACGAAAGAAGTGGCTCTTACAGC 

GGGCGATCGCCCAGTCCCTATGGTCGAAGGCGGTCCAGCAGCCCTTTCCTGAGCAAGCGG 

TCTCTGAGTCGGAGTCCACTCCCCAGTAGGAAATCCATGAAGTCCAGAAGTAGAAGTCCT 

GCATATTCAAGACATTCATCTTCTCATAGTAAAAAGAAGAGATCCAGTTCACGCAGTCGT 

CATTCCAGTATCTCACCTGTCAGGCTTCCACTTAATTCCAGTCTGGGAGCTGAACTCAGT 

AGGAAAAAGAAGGAAAGAGCAGCTGCTGCTGCTGCAGCAAAGATGGATGGAAAGGAGTCC 

AAGGGTTCACCTGTATTTTTGCCTAGAAAAGAGAACAGTTCAGTAGAGGCTAAGGATTCA 

GGTTTGGAGTCTAAAAAGTTACCCAGAAGTGTAAAATTGQAAAAATCTGCCCCAGATACT 

GAACTGGTGAATGTAACACATCTAAACACAGAGGTAAAAAATTCTTCAGATACAGGGAAA 

GTAAAGTTGGATGAGAACTCCGAGAAGCATCTTGTTAAAGATTTGAAAGCACAGGGAACA 

AGAGACTCTAAACCCATAGCACTGAAAGAGGAGATTGTTACTCCAAAGGAGACAGAAACA 

TCTIGAAAAGGAGACCCCTCCACCTCTTCCCACAATTGCTTCTCCCCCACCCCCTCTACCA 

ACTACTACCCCTCCACCTCAGACACCCCCTTTGCCACCTTTGCCTCCAATACCAGCTCTT 

CCACAGCAACCACCTCTGCCTCCTTCTCAGCCAGCATTTAGTCAGGTTCCTGCTTCCAGT 

ACTTCAACTTTGCCCCCTTCTACTCACTCAAAGACATCTGCTGTGTCCTCTCAGGCAAAT 

TCTCAGCCCCCTGTACAGGTTTCTGTGAAGACTCAAGTATCTGTAACAGCTGCTATTCCA 

CACCTGAAAACTTCAACGTTGCCTCCTTTGCCCCTCCCACCCTTATTACCTGGAGGTGAT 

GACATGGATAGTCCAAAAGAAACTCTTCCTTCAAAACCTGTGAAGAAAGAGAAGGAACAG 

AGGAOVCGTCTICTTACTCACAGACCTTCCTCTCCCTCCAGAGCTCCCTGGTGGAGATCTG 

TCTCCCCCT^GACTCTCCT^GAACCAAAGGCAATCACACCACCTCAGCAACCATATA 

AGACCAAAAATTTGTTGTCCTCGTTATGGAGAAAGAAGACAAACAGAAAGCGACTGGGGG 

AAACGCTGTGTGGACAAGTTTGACATTATTGGGATTATTGGAGAAGGAACCTATGGCCAA 

GTATATAAAGCCAGGGACAAAGACT^CAGGAGAACTAGTGGCTCTGAAGAAG^ 

GACAATGAGAAAGAGGGCTTCCCAATCACAGCCATTCGTGAAATCAAAATCCTTCGTCAG 

TTAATCCACCGAAGTGTTGTTAACATGAAGGAAATTGTCACAGATAAAC7UVGATGCACTG 

GATTTCAAGAAGGACAAAGGTGCCTTTTACCTTGTATTTGAGTATATGGACCATGACTTA 

ATGGGACTGCTAGAATCTGGTTTGGTGCACTTTTCTGAGGACCATATCAAGTCGTTCATG 

AAACAGCTAATGGAAGGATTGGAATACTGTCACAAAAAGAATTTCCTGC^ 

AAGTGTTCTAACATTTTGCTGAATAACTVGTGGGCT^TCJ^CTAGCA 

GCTCGGCTCTATAACTCTGAAGAGAGTCGCCCTTACACAAACAAAGTCATTACTTTGTGG 

TACCGACCTCCAGAACTACTGCTAGGAGAGGAACGTTACACACCAGCCATAGATGTTTGG 

AGCTGTGGATGTATTCTTGGGGT^CTATTCACAAAGAAGCCTATTTTTCAAGCCAATCTG 

GAACTGGCTCAGCTAGAACTGATCAGCCGACTTTGTGGTAGCCCTTGTCCAGCTGTGTGG 

CCTGATGTTATCAAACTGCCCTACTTCAACACCATGAAACCGAAGAAGCAATATCGAAGG 
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CGTCTACGAGAAGAATTCTCTTTCATTCCTTCTGCAGCACTTGATTTATTGGACCACATG 

CTGACACTAGATCCTAGTAAGCGGTGCACAGCTGAACAGACCCTACAGAGCGACTTCCTT 

AAAGATGTCGAACTCAGCAAAATGGCTCCTCCAGACCTCCCCCACTGGCAGGATTGCCAT 

GAGTTGTGGAGTAAGAAACGGCGACGTCAGCGACAAAGTGGTGTTGTAGTCGAAGAGCCA 

CCTCCATCCAAAACTTCTCGAAAAGAAACTACCTCAGGGACAAGTACTGAGCCTGTGAAG 

AACAGCAGCCCAGCTVCCACCTCAGCCTGCTCCTGGCTU^GGTGGAGTCTGGGGCTGGGC^ 

GCAATAGGCCTTGCTGACATCACACT^CAGCTGAATCAAAGTGAATTGGCAGTGTTATTA 

AACCTGCTGCAGAGCCAAACCGACCTGAGCATCCCTCAAATGGCACAGCTGCTTAACAT^ 

CACTCCAACCCAGAGATGCAGCT^GCAGCTGGAAGCCCTGAACCAATCCATCAGTGCCCTG 

ACGGAAGCTACTTCCCAGCAGCAGGACTCAGAGACCATGGCCCCAGAGGAGTCTTTGAAG 

GAAGCACCCTCTGCCCCAGTGATCCTGCCTTCAGCAGAACAGATGACCCTTGAAGCTTCA 

AGCACACCAGCTGACATGCAGAATATATTGGCAGTTCTCTTGAGTCAGCTGATGAAAACC 

CAAGAGCCAGCAGGCAGTCTGGAGGAAAACAACAGTGACTVAGAACTVGTGGGCCAC^ 

CCCCGAAGAACTCCCACAATGCCACAGGAGGAGGCAGCAGCATGTCCTCCTCACATTCTT 

CCACCAGAGAAGAGGCCCCCTGAGCCCCCCGGACCTCCACCGCCGCCACCTCCACCCCCT 

CTGGTTGAAGGCGATCTTTCCAGCGCCCCCCAGGAGTTGAACCCAGCCGTGACAGCCGCC 

TTGCTGCAACTTTTATCCCAGCCTGAAGCAGAGCCTCCTGGCCACCTGCCACATGAGCAC 

CAGGCCTTGAGACCAATGGAGTACTCCACCCGACCCCGTCCAAACAGGACTTATGGAAAC 

ACTGATGGGCCTGAAACAGGGTTCAGTGCCATTGACACTGATGAACGAAACTCTGGTCCA 

GCCTTGACAGAATCCTTGGTCCAGACCCTGGTGAAGAACAGGACCTTCTCAGGCTCTCTG 

AGCCACCTTGGGGAGTCCAGCAGTTACCTVGGGCACy^GGGTCAGTGCAGTTTCCAGGGGAC 

CAGGACCTCCGTTTTGCCTIGGGTCCCCTTAGCGTTACACCCGGTGGTCGGGCAACCATTC 

CTGAAGGCTGAGGGAAGCAGCAATTCTGTGGTACATGCAGAGACCAAATTGCAT^CTAT 

GGGGAGCTGGGGCCAGGAACCACTGGGGCCAGCAGCTCAGGAGCAG^ 

GGCCCAACTCAGTCTTCTGCTTATGGAAAACTCTATCGGGGGCCTACAAGAGTCCCACCA 

AGAGGGGGAAGAGGGAGAGGAGTTCCTTACTAA 

SEQ ID NO: 52_AA789239_H 

TGAAAATGGAGATGTATGAAACCCTTGGAAAAGTGGGAGAGGGAAGTTACGGAACAGTCA 

TGAAATGTAAACATAAGAATACTGGGCAGATAGTGGCCATTAAGATATTTTATGAGAGAC 

CAGAACAATCTGTCAACT^AAATTGCGATGAGAGAAATAAAGTTTCTAAAGCAATTTC^ 

ACGAAAACCTGGTCAATCTGATTGAAGTTTTTAGACAGAAAAAGAAAA^^ 

TTGAATTTATTGACCACACAGTATTAGATGAGTTACAACATTATTGTCATGGAC 

GTAAGCGACTTAGAAAATACCTCTTCCAGATCCTTCGAGCAATTGACTATCTTCACAGT^ 

ATAATGTAATCATTCATCGAGATATAAAACCTGAGAATATTTTAGTATCCCAGTCAGGAA 

TTACTAAGCTCTGTGATTTTGGTTTTGCACGAACACTAGCAGCTCCTGGGGACATTTATA 

CGGACTATGTGGCCACACGCTGGTATAGAGCTCCCGAATTAGTATTAAAAGATACTTCTT 

ATGGAAAGTATGTGCCTGTGGATATCTGGGCTTTGGGCTGTATGATCATTGAGATGGCCA 

CTGGAAATCCCTATCTTCCTAGTAGTTCTGATTTGGATTTACTCCATAAAATTGTTTTGA 

AAGTGNGATTCATGCCAGAACTGAAAGCTAAATTACTGCAGGAAGCAAAAGTCT^TTC^ 

TAATAAAGCCAAAAGAGAGTTCTAAAGAAAATGAACTCT^GGAAAGATGAAAGAAAAACAG 

TTTATACCAATACACTGCTAAGTAGTTCAGTTTTGGGAAAGGAAATAGAAAAAGAGAAAA 

AGCCCAAGGAGATCAAAGTCAGAGTTATTAAAGTCAAAGGAGGAAGAGGAGATATCTCAG 

AACCAAAAAAGAAAGAGTATGAAGGTGGACTTGGTCAACAGGATGCAAATGAAAATGTTC 

ATCCTATGTCTCCAGATACTVAAACTTGTAACCATTGAACCACCAAACCCTATC^ 

GCACTAACTGTAATGGCTTGAAAGAAAATCCACATTGCGGAGGTTCTGTGACAATGCCAC 

CCATCAATCTAACTAACAGTAATTTGATGGCTGCAAATCTCAGTTCT^TCT 

CCAGTGTGAGGTTAACTGAAAGAGCAAAAAAGAGACGCy^CTTCTTCACTATCTATTGG 

AAGTTATGCCTAATAGCAGGCAAGAGGATCCAGGTCCTATTCAAAGCCAAATGGAGAAGG 

GTATATTTAATGAGCGAACAGGTCACAGTGACCAAATGGCAAATGAGAACAAAAGG^ 
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TGAATTTTTCCAGATCTGACAGGAAAGAATTCCATTTTCCAGAATTGCCTGTCACAATAC 
AGTCAAAAGATACAAAAGGAATGGAAGTTAAACAGATAAAAATGCTGAAGAGGGAGTCAA 
AGAAAACAGAGTCATCTAAGATACCAACTTTACTTAACGTGGATCAAT^TCAAGAAAAAC 
AAGAGTTTATTCCCTTATCTCTGCTGTCTGCCTGCTGTCCTATTTTCACAAATATTTGCT 
CTCAGCTAACTATCAGGGTGGAGATGGCCATTGCGAGGGGAAGAATTTGAAGAGAAACAG 

TTCCCGTTTTATTTATCTACT^TAGAAGTGTGATGTGAGTTGTTGTTAAGACAGCCATCC 
ATGTGCATGAGCATCATCCAGCTTTTTTTGTTAGCAAAACATTTACTGTTTTCTTTTCCC 
TTTTAAGACTCTGTTGATGTGATAATTTGATTTGGAATTATAAAGTCATCTCTTCTCTGC 

CTTGAA 

SEQ ID NO: 53_AA124976_M 

CTGGCAGATATAGTTCATGCTTGTTTACAAATTGATCCTGCTGAGAGGACATCATCTACT 

GATCTTTTGCGTCACGATTACTTTACTAGAGATGGATTTATTGAGAAATTCATACCAGAG 

CTGAGAGCTAAATTATTACAGGAAGCAAAGGTTAATTCATTTATAAAGCCAAAAGAGAAT 

TTTAAAGAAAATGAACCTGTGAGAGATGAGAAGAAATCAGTTTTTACCAACACCCTGCTC 

TATGGATATCCATCTVCTTTATGGCT^GGAAGTGGACAGAGACAAAAGGGCCAAGGAG^^ 

AAAGTCAGAGTCATTAAGGCCAAAGGGGGCAAAGGAGATGTCCCAGACCAGAAGAAGCCA 

GAGTATGAAGGCGACCACCGCCAGCAGGGCACAGCTGATGACACACAGCCCTCATCTVCT^ 

GACAAGAAGCCTTCTGTCTTGGAACTGACAAACCCTCTCAATCCCAGTGAGAATTCTGAC 

GGTGTCAAAGAAGACCCACACGCTGGGGGTTGTATGATAATGCCACCTATCAACCTGACA 

AGCAGTAATTTGTTGGCCGCAAATCTCAGTTCAAACCTTTCCCACCCCAATT^^ 

ACTGAAAGAACAAAAAAGAGACGCACTTCTTCJ^CAAACTATTGGACAGACTTTGTCTAAT 

AGCAGACAAGAGGACACAGGTCCCACACAAGTCCy^CAGAGAAAGGTGCATTTAATGAG 

CGAACAGGTCAGAATGACCAAATATCGAGTGGGAACAAAAGAAAGCTGAATTTTCCCAAA 

TGCGACAGGAAAGAATTCCATTTCCCTGAACTGCCATTCACAGTGCAGGCGAAGGAGATG 

AAAGGGATGGAAGTTAAACAGATAAAAGTGCTGAAGAGAGAATCAAAGAAAACAGATTCA 

TCTAAAATACCAACTTTACTTAGTATGGACCCAAATCAAGAAAAACAAGAGGGTGGAGAT 

GGCGATTGTGAGGGGAAGAATTTGAAGAGGAACAGATTTTTTTTTTCCCGATAGTGCTTT 

GTCTTTTAAGTAATCTTAAAAATACAAGCTTGACAATTCCTTCCTTTTTATTTTATATAC 

ACTAGTU^TGTACT^TAGGTTGCTGCTAAGATAGCCACCCATCCCATCTGCATCAACATCAT 

CTATTTTTTTGGTTTTGCTAGCAAAATTTTCAC/^TTTTTCTCTATCTTCC^^ 

TATTTTGATGCTGTGATTTGAAATTATAAAGTCACCTCCTCTGTCTGCTTCCTTCCTTGC 

CATGATTACTGAGTGGGTAGTCACATGATGTGCCCTGCTCGCACTGCTCTCAGACTGCTG 

AGACTCAAACCTCATAAGCCAGGGGTCTCCTGGGAAGCACTGGCCTCTTCAAGTGGATGC 

TCGATGAACCTTCTTATCTGTTGTCTTAGTAACCACTCGTTGCCATCACATGATGAAAGA 

CATTCTATTGTCCCCAGTGAAGCATTTATAGTACTTACATAACATGTTACAGTGATATGA 

TGTTCCTAGGTTAAACTCCTTGAGATGAAACTATTTCCTGCATTCTCTGACTCCCCTAGT 

CTAATAGTTCCTTCCATTTAGCCAGAAGAATTTCCTGAAGAAGCGATGCACAACCTGGGA 

AAGGTTTACTTTCTATCCTGGGCTGTTTTCTGTTGCTAAATAATATAGACTGGGTAGTTA 

GTTAACAT 

SEQ ID NO: 54_AA575635_M CCRK_M 

AGCGCCTCAGGCCAGCTCAAGATAGCTGACTTTGGCCTGGCCCGGGTCTTCTCTCCGGAT 
GGTGGTCGCCTCTACACACATCAGGTGGCCACCAGGTGGTACCGAGCTCCTGAACTCCTG 
TATGGCGCTCGGCAGTATGACCAGGGCGTTGACCTATGGGCTGTGGGCTGCATCATGGGA 
GAGCTGTTGAATGGGTCCCCCCTGTTCCCGGGCGAAAACGACATTGAACAACTGTGCTGT 
GTGCTTCGCATCCTGGGTACCCCGAGTCCTCGAGTCTGGCCGGAGATCACAGAGCTGCCT 
GACTACAACAAGATCTCCTTCGAGGAGCAGGCACCAGTGCCCCTGGAGGAGGTGCTGCCT 
GATGCCTCTCCCCAGGCCTTGGACCTGCTGGGCCAGTTCCTCCTCTACCCTCCACGACAG 
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CGTATTGCAGCCTCCCAGGCCCTTCTGCATCAGTACTTCTTCACAGCGCCTCTGCCTGCC 

CATCCATCCGAGCTGCCAATTCCTCAGCGCCCAGGGGGACCTGCACCCAAGGCT 

GGGCCCCCCCATGTCCACGACTTCCATGTGGATCGACCTATTGAGGAGTCACTGTTGAAC 

CCAGAACTGATTCGGCCCTTCATCCCAGAGGGGTGAGATGCTGGTCCAGGCCTTCCTGCT 

CGCCCTAGGAGCACCTCTTTCTGATTTGCCTCCATGGCCTCCCCACGGCTATATATACCA 

CACCTGGTCCTGCTCCTGAGTGTGCTTGAGGGCTGGGCTCTGGGAGGCAGAACCGTGAGA 

TGTTCATCCCAGCAGAGAAAGAGACTCACGTCCTACAGACAAAGCCTCCAGAAACTGCTA 

GCTGTGTCCTTCTCCAGGGCCACCCCTCAGTGGTGCCACCCGGCCTTAGAGATGATTGTC 

AGGCTCTGTCCCCTCTTCAAGGACATTGGTACTACAGCACCACCTGGTGGAAGCACAGAG 

TATAAGCTGTCTTCATACTGGGGACACAGCTGGGAAGTCAGACATGTTTTAGTTTTGGTT 

CCACTGGGTCAGGATTTGAGGTTCATATAAAAGCCCTGGGTGTTTCTGTCTAATTGCACC 

TTGTCTGTTGCTGTTAGGGAAAGGACAATGGTGGGCCTTGATTCACAGGGGTCAGGTACT 

CAGAAGGGGCCTCCTGTGAAGGCCATTTGGGTCCTCAGGCTTCCCATGCTATTCACGGGA 

CTTGAGTGCTCATTTGGGAGCGAGGGTCCAGAAGCTGAGGCCCAGGGATGGACAGTCCAG 

TTCCCGAAGCCCACTTCCCACATGTCGGGTGGGTCAGTCAGTGAGCCTGAGGCTGCCTTG 

CAGATGCGGAAGCAGGCATTCCTGGAATCCACTCAGT7VAATAAATTCCAGTGTGACTCAG 

SEQ ID NO: 55_AA631990_H 

GAACAACAATAACAGAATAAGGAAGAAAATCTCATGATTACCTCAATAAGTAC^ 

TCTGGTCACACTCACTATCCATTCATGATTACAACTCTTCAATACTATCGCGGCCGAGGA 

GGGAAGACGGCAGTTTGGCGACATTTCTCGGCCGAAGGGCCATTTGCTTTTGCGGAGATG 

CGGCATTCCAAAAGAACTCACTGTCCTGATTGGGATAGCAGAGAAAGCTGGGGACAT 

AGCTATCGTGGAAGTCACAAGCGGAAGAGGAGATCTCATAGTAGCTICACAAGAGAAC^ 

CATTGTAAACCTVCATCACCAGTTTAAAGAATCTGATTGTCATTATTTAGAAGC^ 

TTGAATGAGCGAGATTATCGGGACCGGAGATACGTTGACGAATACAGGAATGACTACTGT 

GAAGGATATGTTCCTAGACATTATCACAGAGACATTGAAAGCGGGTATCGAATCCACTGC 

AGTAAATCTTCAGTCCGCAGCAGGAGAAGCAGTCCTAAAAGGAAGCGCAATAGACACTGT 

TCAAGTCATCAGTCACGTTCGNATGAAATCGTGGACACTTTGGGTGAAGGAGCCTTTGGC 

AAAGTTGTAGAGTGCATTGATCATGGCATGGATGGCATGCATGTAGCAGTGAAAATCGTA 

AAAAATGTAGGCCGTTACCGTGAAGCAGCTCGTTCAGAAATCCAAGTATTAGAGCACTTA 

AATAGTACTGATCCCAATAGTGTCTTCCGATGTGTCCAGATGCTAGAATGGTTTGATCAT 

CATGGTO^TGTTTGTATTGTGTTTGAACTACTGGGACITAGTACTTACGATTTCATTAAA 

GAAAACAGCTTTCTGCCATTTCAAATTGACCACATCAGGCAGATGGCGTATCAGATCTGC 

CAGTC7VATAAATTTTTTACATCATAATAAATTAACCCATACAGATCTGAAGCCTGAAAAT 

attttgtttgtgaagtctgactatgtagtcaaatataattctaaaatgaaacgtgatgaa 

cgovcactgaaaaacacagatatcaaagttgttgactttggaagtgcaacgtatgatgat 

gaacatcacagtactttggtgtctacccggcactacagagctcccgaggtc7vttttggct 

ttaggttggtctcagccttgtgatgtttggagcataggttgcattcttattgaatattac 

cttggtttcagagtctttovgactcatgatagtaaagagcacctggcaatgatgg;^ 

atattaggacccataccacaacactitgattcagaaaacaagaaaacgca^ 

cataaccagctagattgggatgaacacagttctgctggtagatatgttaggagacgctgc 

aaaccgttgaaggaatttatgctttgtcatgatgaagaacatgagaaactgtttgacctg 

GTTCGAAGAATGTTAGAATATGATCCAACTCAAAGAATTACCTTGGATGAAGCT^TTGC^ 

catcctttctttgacttattaaaaaagaaatgaaatgggaatcagtggtcttactatat^ 

cttctctagaagagattacttaagactgtgtcagtcaactaaacattctaatatttttgt 

aaacattaaattattttgtacagttaagtgtaaatattgtatgttttgtatcaatagca 

aattaacttgttaagcaagtatggtcttgataatgcattagaaaaattaaaattaatttt 

tctttttgaaattaccatttttaaatacctttgaaatatcctttgtgtccagtgataaat 

gtgattgatcttgccttttgtacatggaggtcacctctgaagtgattttttttgagtaaa 

aggaaatcttgactactttatattcttaaaggaatattctttatatacttcaaatttaga 
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ACTTAACTTTAAAAGTTTTTCTTCTGTAATTGTTGAACGGGTGATTATTATTAACTCTAG 
ATAAGCAGGTACTAGAAACCAAAACTCAGAAAATGTTTACTGTTAGAATTCTATTAAATT 
TTAAGTGTTGTATTCTTTTTCATTGGGTGATGTCAGGGTGATAACCAGACATTCATGGAA 
AGGCATGCAGTTTGTCCATTGTGACAGTTTGTTTAATAAAACCACATACACACTTTATTT 
AAGATTAAAATCTAACTGGAAAGTCAGCTTGGAAAATGGACATTTCCAAGTATGTTTGGT 
GAGTCACAGATATAAAAATAGAAATTCTGATGAGAGGTTTCAGTTTTTAATACCAAGTCC 
TTAGGAGTCTTAACATTGGCCAGCATCTGTTTATCAAATGACATAAATACGTAAACCTAT 
AAGAATTAAGTTTATTAATTAGGCTATTTATGTCTGTGATAATTCTTACGGGAGAAAGAG 
GATTTGATTGGAAAGCAGTTTGGGAAGAAAGTGCTGCTGAAATTTCCAGAATTTAATTGA 
TTGGTTACAT7VAACTTTTTGACTTCAAT 

SEQ ID NO: 56_AA557536_H 

AGTAAGGCCCCGCGGGCGTCCTGGCCGCCATGTGCACCGTAGTGGACCCTCGCATTGTCC 

GGAGATACCTACTCAGGCGGCAGCTCGGGCAGGGGAGAACATTCCGGGAAATCACC^ 

TCCAGGTGAGTGGCCTGGGCCCTCCAGTCCAATCCCCTTGCCCAGGTACAGATCTCTCCA 

GAC7VGGAGAGAAACTGGCCTTCTTGGGCCCCAGAGCACAGCCCCTCCTGGCCTTCCAGCC 

GCCTCCGACTCTCTCCCCAGGAGTTTGGGGACCATCCCAACATCATCAGCCTCCTTGACG 

TGATCCGGGCAGAGAACGACAGGGACATTTACCTGGTGTTTGAGTTTATGGACACTGACC 

TGAACGCAGTCATCCGGAAGGGCGGCCTGCTGCAGGACGTCCACGTGCGCTCCATCTTCT 

ACCAGCTCCTGCGGGCCACCCGGTTCCTCCACTCGGGGCACGTTGTGCACCGGGACCAGA 

AGCCGTCCAATGTGCTCCTGGATGCCAACTGCACAGTGAAGCTGTGTGACTTTGGCCTGG 

CCCGCTCCCTGGGCGACCTCCCTGAGGGGCCTGAGGACCAGGCCGTGACAGAGTACGTGG 

CCACACGCTGGTACCGAGCACCGGAGGTGCTGCTCTCTTCGCACCGCTACACCGCTTCCT 

GCCCCAGATACACCCTTGGGGTGGACATGTGGAGTCTGGGCTGTATCCTGGGGGAGATGC 

TGCGGGGGAGACCCCTGTTCCCCGGCACGTCCACCCTCCACCAGCTGGAGCTGATCCTGG 

AGACCATCCCACCGCCATCTGAGGAGXXXAGGCCACGACAGACGCTGGATGCCCTCCTAC 

CGCCAGACACCTCCCCAGAGGCCTTGGACCTCCTTAGGCGACTCCTGGTGTTCGCCCCGG 

ACAAGCGGTTAAGCGCGACCCAGGCACTGCAGCACCCCTACGTGCAGAGGTTCCACTGCC 

CCAGCGACGAGTGGGCACGAGAGGCAGATGTGCGGCCCCGGGCACACGAAGGGGTCCAGC 

TCTCTGTGCCTGAGTACCGCAGCCGCGTCTATCAGATGATCCTGGAGTGTGGAGGCAGCA 

GCGGCACCTCGAGAGAGAAGGGCCCGGAGGGTGTCTCCCCAAGCCAGGCACACCTGCACA 

AACCCAGAGCCGACCCTCAGCTGCCTTCTAGGACACCTGTGCAGGGTCCCAGACCCAGGC 

CCCAGAGCAGCCCAGGCCATGACCCTGCCGAGCACGAGTCCCCCCGTGCAGCCAAGAACG 

TTCCCAGGCTVGAACTCCGCTCCCCTGCTCCAAACTGCTCTCCTAGGGAATGGGGAAAGGC 

CCCCTGGGGCGAAGGAAGCGCCCCCCTTGACACTCTCGCTGGTGAAGCCAAGCGGGAGGG 

GAGCTGCGCCCTCCCTGACCTCCCAGGCTGCGGCTCAGGTGGCCAACCAGGCCCTGATCC 

GGGGTGACTGGAACCGGGGCGGTGGGGTGAGGGTGGCCAGCGTAC7UVCAGGTCCCTCCCC 

GGCTTCCTCCGGAGGCCCGGCCCGGCCGGAGGATGTTCAGCACCTCTGCCTTGCAGGGTG 

CCCAGGGGGGTGCCAGGGCTTTGCTTGGAGGCTACTCCCAAGCCTACGGGACTGTCTGCC 

ACTCGGCACTGGGCCACCTGCCCCTGCTGGAGGGGCACCATGTGTGAGCCGCCCTACTCC 

CTTCACCTGGCCCTCTGTTCCTGCCCCAGCNCCTTCCCCAGACCCCTCTCCAGTCTCCTG 

CACCCCTTAGCCCTCCCTGCTTTGCCTGGCCCGTTGAAGTTCCAGGGAGCTTGCCCGGGT 

CTCCTCGGGGGAGCAGATGAGGGCCCTGCCC 

SEQ ID NO: 57_N28606_H, MOK_H 

ATGAAGAACTATAAAGCAATTGGCAAAATAGGAGAGGGAACGTTTTCTGAAGTTATGAAG 
ATGCAAAGCCTGAGAGATGGAAACTACTATGCATGTAAACAAATGAAGCAGCGCTTTGAA 
AGTATTGAGCAAGTCAACAACCTACGAGAGATCCAAGCACTGAGGCGCCTGAATCCGCAC 
CCAAACATTCTTATGTTGCATGAAGTGGTTTTTGACAGAAAATCTGGTTCTCTTGCACTA 
ATATGTGAACTTATGGACATGAATATTTATGAGCTAATACGAGGGAGAAGATACCCATTA 
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TCAGAAAAAAAAATTATGCACTATATGTACCAGTTATGTAAGTCCCTGGATCATATTCAC 
AGAAATGGAATATTTCACAGAGATGTAAAACCAGAAAATATACTAATAAAGCAGGATGTC 
CTGAAATTAGGGGACTTTGGCTCCTGCCGGAGTGTCTATTCCAAGCAGCCGTACACGGAA 
TACATCTCCACCCGCTGGTACCGGGCCCCGGAGTGTCTCCTCACTGATGGGTTCTACACG 
TACAAGATGGACCTGTGGAGCGCCGGCTGTGTGTTCTACGAGATCGCCAGTCTGCAGCCC 
CTCTTTCCTGGAGTAAATGAACTGGACCAAATCTCAAAAATCCACGATGTCATCGGCACA 
CCCGCTCAGAAGATCCTCACCAAGTTCAAACAGTCGAGAGCTATGAATTTTGATTTTCCT 
TTTAAAAAGGGATCAGGAATACCTCTACTAACAACCAATTTGTCCCCACAATGCCTCTCC 
CTCCTGCACGCAATGGTGGCCTATGATCCCGATGAGAGAATCGCCGCCCACCAGGCCCTG 
CAGCACCCCTACTTCCAAGAACAGAGGAAAACAGAGAAGCGGGCTCTGGGCAGCCACAGA 
AAAGCTGGCTTTCCGGAGCACCCTGTGGCACCGGAACCACTCAGTAACAGCTGCCAGATT 
TCCAAGGAGGGCAGAAAGCAGAAACAGTCCCTAAAGCAAGAGGAGGACCGTCCCAAGAGA 
CGAGGACCGGCCTATGTCATGGAACTGCCCAAACTAAAGCTTTCGGGAGTGGTCAGACTG 
TCGTCTTACTCCAGCCCCACGCTGCAGTCCGTGCTTGGATCTGGAACAAATGGAAGAGTG 
CCGGTGCTGAGACCCTTGAAGTGCATCCCTGCGAGCAAGAAGACAGATCCGCAGAAGGAC 
CTTAAGCCTGCCCCGCAGCAGTGTCGCCTGCCCACCATAGTGCGGAAAGGCGGAAGATAA 

SEQ ID NO: 58_AB023153_H, ICK_H 

ATGAATAGATACACAACAATCAGGCAGCTCGGGGArGGAACCTACGGTTCCGTCCTGCTG 
GGAAGAAGCATTGAGTCTGGGGAGCTGATCGCTATTAAAAAAATGAAAAGAAAATTTTAT 
TCCTGGGAGGAATGCTITGAACCAACGGGAGGTTAAGTCTTTAAAGAAGCTCAACCATGCC 
AATGTAGTCAAAITAAAAGAAGTTATCAGGGAAAATGATCATCTTTATTTTATCTTCGAG 
TACATGAAGGAAAATCTTTACCAGCTCATTAAAGAGAGAAATAAGTTGTTTCCTGAGTCT 
GCTATAAGGAATATCATGTATCAGATATTACAAGGACTCGCATTTATTCACAAACTCGGC 
TTCTTTCATCGAGACTTAAAGCCTGAGAACCTCCTCTGCATGGGACCAGAACTTGTGAAA 
ATTGCAGACTTTGGTTTGGCCCGAGAAATACGATCAAAACCTCCATATACAGATTATGTA 
TCTACCAGATGGTACAGGGCTCCAGAAGTACTCCTGAGGTCTACCAACTACAGCTCCCCC 
ATTGACGTCTGGGCGGTGGGCTGCATCATGGCAGAAGTTTACACCCTCAGGCCACTCTTC 
CCTGGAGCCAGTGAAATTGACACAATATTCAAAATTTGCCAAGTGCTGGGGACACCAAAA 
AAGACTGACTGGCCTGAAGGCTATCAACTTTCAAGTGCAATGAACTTCCGTTGGGCACAG 
TGTGTACCCAATAACTTAAAGACCTTGATTCCCAATGCTAGCAGTGAAGCAGTCCAGCTC 
CTGAGAGACATGCTTCAGTGGGATCCCAAGAAACGACCAACAGCTAGTCAGGCACTTCGA 
TATCCTTACTTCCAAGTTGGACACCCACTAGGCAGCACCACACAAAACCTTCAGGATTCA 
GAAAAACCACAGAAAGGCATCCTGGAAAGGGCAGGCCCACCTCCTTATATTAAGCCAGTC 
CCACCTGCCCAGCCACCAGCCAAGCCACACACACGAATTTCTTCACGACAGCATCAAGCC 
AGCCAGCCCCCTCTGCATCTCACGTACCCCTACAAAGCAGAGGTCTCCAGGACAGATCAC 
CCAAGCCATCTCCTIGGAGGACAAGCCAAGCCCGTTGCTTTTCCCATCCCTCCACAACAAG 
CATCCACAGTCGAAAATCACAGCTGGCCTGGAGCACAAAAATGGTGAGATAAAGCCAAAG 
AGTAGGAGAAGGTGGGGTCTTATTTCCAGGTCAACAAAGGATTCAGATGATTGGGCTGAC 
TTGGATGACTTGGATTTCAGTCCATCCCTCAGCAGGATTGACCTGAAAAACAAGAAAAGA 
CAGAGTGATGACACTCTCTGCAGGTTTGAGAGTGTTTTGGACCTGAAGCCCTCTGAGCCT 
GTGGGCACAGGAAACAGTGCCCCCACCCAGACGTCATATCAGCGGCGAGACACGCCCACC 
CTGAGATCTGCAGCCAAGCAGCACTATTTGAAGCACTCTCGATACTTGCCTGGGATCAGT 
ATAAGAAATGGCATACTCTCGAATCCAGGCAAGGAATTTATTCCACCTAATCCATGGTCT 
AGTTCTGGCTTGTCTGGAAAATCTTCAGGGACAATGTCAGTAATCAGCAAAGTAAATTCA 
GTTGGTTCCAGCTCTACAAGTTCTAGTGGACTGACTGGAAACTATGTCCCTTCCTTTCTG 
AAAAAAGAAATCGGTTCTGCTATGCAGAGGGTACACCTAGCACCTATTCCAGACCCTTCC 
CCTGGTTATTCCTCCCTGAAGGCCATGAGACCTCATCCTGGGCGACCATTCTTGGACACC 
CAGCCTAGAAGCACTCCTGGGTTGATACCACGGCCTCCAGCCGCCCAGCCAGTGCATGGC 
CGGACAGACTGGGCTTCCAAGTACCCATCCCGGCGGTGA 
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SEQ ID NO: 59_AA83 9940_M 

AGCAGCAACAATGGTGGCATGAGTGCAGAGGAGGAGATAGGGCCTGGGGCTGAGCCTATG 

AGAGGACCAAGCTTGGCTACAAGGGACTGGAGAGATGAGACTGTTGGGACCACAGACCTG 

CAGCAAGGCATAGACCCAGGAGCAGTGAGCCCTGAGCCTGGGAAGGACCACGCAGCCC^^ 

GGCCCAGGAAGAACTGAAGCTGGAAGGGTATCTTCTGCTGCAGAGGCTGCCATTGTGGTT 

CTAGATGACAGCGCAGCACCCCCAGCCCCTTTTGAACACCGGGTAGTGAGCATCAAAGAT 

ACCCTGATCTCTIGCAGGCTACACGGTATCCCAACATGAAGTCTTAGGAGGGGGTCGGTTT 

GGCCAGGTGCACAGGTGTACAGAGAGGTCTACAGGCCTTGCACTGGCAGCCAAGATCATC 

AAAGTGAAGT^CGTAAAGGACCGGGAGGATGTGAAGAATGAGGTCAACATCATGAACCAG 

CT(::7i.GCCACGTAAACTTGATCCAACTTTATGATGCGTTTGAGAGCAAGAACAGCTT^^ 

CTGATCATGGAGTATGTGGATGGAGGCGAACTCTTTGACCGGATCACGGATGAGAAGTAC 

CACCTCACTGAGTTGGATGTGGTCTTGTTCACGAGGCAGATCTGTGAGGGTGTGCATTAC 

CTGCATCAGCACTATATCCTGCACCTGGACCTCAAGCCTGAGAACATATTGTGTGTCAGC 

C7LGACAGGGCATCAAATTAAGATCATTGACTTTGGGCTGGCTAGAAGATACAAGCCTCGG 

GAGAAGCTAAAGGTGAACTTTGGTACTCCGGAGTTCCTGGCCCCAGAAGTTGTTAACTAT 

GAGTTTGTGTCATTTCCAACAGACATGTGGAGTGTGGGAGTTATOVCCTACATGCTACTC 

AGTGGTTTGTCCCCATTTCTAGGGGAGACAGATGCAGAGACCATGAATTTTATTGTGAAC 

TGCAGCTGGGATTTCGATGCTGATACCTTCAAAGGGCTGTCGGAGGAAGCCAAGGACTTT 

GTTTCCCGGTTACTGGTCAAAGAGAAGAGCTGTAGGATGAGCGCCACACAGTGCCTGAAA 

CACGAGTGGTTAAATCACCTGCCTGCCAAAGCCTCGGGCTCCAACGTTCGCCTCAGATCC 

CAACAACTGCTGCAGAAATATATGGCTCAGAGTAAATGGAAGAAACATTTCCACGTGGTG 

GCTGCAGTCAACAGGCTACGGAAATTTCCAACGTGTCCCTAATCTTCAACTCTGGTGTTC 

CACTGGGCCTGGGAATTCTTGAGGCAACACGAAGTGGTAATATGAAGAGATTACTCAAGA 

TTTTATGTAGATTGGCGCTTTGCTATTATTGATTTTTCTTATTTTGCAAAGAATGATGGA 

AGGAAGCAAGAAAGAAAGAAAAGAAGAAAAGGGGGAAGAAAAGGAAAAGGCAGAAAGCAA 

GGAAACAGGCTACGTTGTTGCTCTTCTTGTAGGTGAAAGTGTTTTTATTAAAAGCCCTAG 

GAATGTTTTTCTGCCTCGTAAGGTCAGCAGGTCTCATATGCTGCTTGCTACCCCGCACCC 

TTCCTTTTGGTAATAAGAGCAGGCACGCTCAGGATGGGCAGGGAAATCCTACTTGGCTTT 

TGGTCAAATTTGAATTCTAAACTTGTCATGATTAAAGAAGCCAGTAGGGAGGGAGGTATG 

GAAGAGGGAGGAATTAGGTCCAACAGTGGGGGATGAATTTGACCGAAACATTGTATAAAA 

TTCTTAAAGAATTAATAAAATATATTTTTAAAGGAG 

SEQ ID NO: 60_AA460132_H 

GGAACCTCAGGCTTCAGAGAGCCGAAAAGTTGGGAGGCGTAACCACTTACAGGCCGGAAG 
TGTCCGGGGTGGACGCATTCGGGTAGCCGAAGAAGTCCCAGGATTGCCGAAGAAGTCCCA 
GGATTTCCGAAGCGAGCCGAAGCATCGCGACAGTTTTCAGAGACAGCTGATCGGTTGGAG 
CTGTTGCGCCGAGCAGTCATGGCGGCGGCCAGAGCTACTACGCCGGCCGATGGCGAGGAG 
CCCGCCCCGGAGGCTGAGGCTCTGGCCGCAGCCCGGGAGCGGAGCAGCCGCTTCTTGAGC 
GGCCTGGAGCTGGTGAAGCAGGGTGCCGAGGCGCGCGTGTTCCGTGGCCGCTTCCAGGGC 
CGCGCGGCGGTGATCAAGCACCGCTTCCCCAAGGGCTACCGGCACCCGGCGCTGGAGGCG 
CGGCTTGGCAGACGGCGGACGGTGCAGGAGGCCCGGGCGCTCCTCCGCTGTCGCCGCGCT 
GGAATATCTGCCCCAGTTGTCTTTTTTGTGGACTATGCTTCCAACTGCTTATATATGGAA 
GAAATTGAAGGCTCAGTGACTGTTCGAGATTATATTCT^GTCCACTATGGAGACTGAAAAA 
ACTCCCCAGGGTCTCTCCAACTTAGCCAAGACAATTGGGCT^GGTTTTGGCTCGAATGCAC 
GATGAAGACCTCATTCATGGTGATCTCACCACCTCCAACATGCTCCTGAAACCCCCCCTG 
GAACAGCTGAACATTGTGCTCATAGACTTTGGGCTGAGTTTCATTTCAGCACTTCCAGAG 
GATAAGGGAGTAGACCTCTATGTCCTGGAGAAGGCCTTCCTCAGTACCCATCCCAACACT 
GAAACTGTGTTTGAAGCCTTTCTGAAGAGCTACTCCACCTCCTCCAAAAAGGCCAGGCCA 
GTGCTAAAAAAATTAGATGAAGTGCGCCTGAGAGGAAGAAAGAGGTCCATGGTTGGGTAG 
AAGAATGTGTATGACAACCACACACAGTGAAGCTCTTTTTTCAAAGTAAATTTGAAGAAA 
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TGCTACAAGTATGAGATGAGATCTAAGTAAAGGTGTTAAGATATTTTTAAGTGGTATGTG 
ATCGTGTCATTATCATCTGCACTTCACTCAAGAGCTTACTATGTGTCTAAGTCATGTTCT 
AGGCAGT^TTGGGTATTTAAAGTAAATTGAGGACAGGCTTCTCCCAGATTGTGACATGTA 
TATCTCAGATACATGGGTGTGGCATTGAACCACATAATGAGAACATTATTCTCTTTTTAG 
TCCTTGTGAGACAAGGATGAAGTCTCAGTTGCTGATACTCGCTGAGCTTACTGGCCCTCT 
AACCCAGTGTTTTTTTTTGTTGTTGTTGTGTACATGTTATATTTATTTTGAAACCAGTTT 
AATGGGATACAACCAGCATTTTAAAAAATGAAATAGAATACAGCATGG 

SEQ ID NO: 61_SGK034_H 

CAGAGAGAGAAGGTAAACCAAGGGAACATGCCAGGGCTTCAGAGCACCTTCCTAGCCATG 
GACACGGAGGAGGGGGTAGAGGTGGTGTGGAACGAGCTCCACTTCGGAGACAGGAAGGCC 
TTCGCGGCGCACGAGGAGAAGATCCAGACCGTGTTCGAGCAGCTGGTGCTGGTGGACCAC 
CCGAACATCGTGAAGTTGCACAAGTACTGGCTGGATACCTCTGAGGCCTGCGCGAGGGTC 
ATCTTCATCACAGAGTACGTGTCT^TCAGGCAGCCTCAAGCAATTCCTCAAAAAGACCAAG 
AAGAACCACAAGGCCATGAACGCCCGGGCCTGGAAGCGCTGGTGCACGCAGATCCTGTCT 
GCGCTCAGCTTCCTGCACGCCTGCAGCCCCCCAATCATCCACGGGAACCTGACCAGCGAC 
ACCATCTTCATTCAGCACAACGGCCTCATCAAGATCGGCTCCGTGTGGCACCGAATCTTC 
TCCAATGCACTTCCAGATGATCTCCGAAGCCCCATCCGCGCTGAGCGAGAGGAACTTCGG 
AACCTGCACTTCTTCCCCCCAGAGTATGGAGAGGTGGCCGATGGGACCGCTGTGGACATC 
TTCTCCTTTGGGATGTGTGCGCTGGAGATGGCTGTACTGGA7VATCCAGACCAATGGGGAC 
ACCCGGGTCACAGAGGAGGCCATTGCTCGCGCCAGGCACTCGCTGAGTGACCCCAACATG 
CGGGAGTTCATCCTTTGCTGCCTGGCCCGGGACCCTGCCCGCCGGCCCTCTGCCCACAGC 
CTCCTCTTCCACCGCGTGCTCTTCGAGGTGCACTCGCTGAAGCTCCTGGCAGCCCACTGC 
TTCATCCAGCACCAGTACCTCATGCCTGAGAATGTGGTGGAGGAGAAGACCAAGGCCATG 
GACCTGCACGCGGTCTTGGCGGAGCTTCCCCGGCCCCGCAGGCCCCCGCTGCAGTGGCGG 
TACTCGGAAGTCTCCTTCATGGAGCTGGACAAATTCCTGGAGGATGTCAGGAATGGAATC 
TACCCACTGATGAACTTTGCAGCCACTCGACCCCTGGGGCTGCCCCGTGTGCTGGCCCCA 
CCCCCGGAGGAGGTCCAAAAGGCCAAGACCCCGACGCCAGAGCCCTTTGACTCTGAGACC 
AGAAAGGTCATCCAGATGCAGTGCAACCTGGAGAGAAGCGAGGACAAGGCGCGCTGGCAT 
CTCACTCTGCTTCTGGTGCTGGAAGACCGGCTGCACCGGCAGCTGACCTACGACCTGCTC 
CCAACGGACAGCGCCCAGGACCTCGCCTCGGAGCTCGTGCACTATGGCTTCCTCCACGAG 
GACGACCGGATGAAGCTGGCCGCCTTCCTGGAGAGCACCTTCCTCAAGTACCGTGGGACC 
CAGGCCTGACCCGGAGCCCCAGCCCCAGGGGACCATGCCGGGGTGCTGCCCGGGCAGGCC 
ATGTTGGGGAGACTCCAGCACCGTGGGGCTGCCCTCCTCCATGCGCCTGGGAGCACAAAG 
GCCCCGGTAGTGAAGGAACCCCCCGTCTCCTGAGAGTGGGGCTGACCCTGCCTTGGGCGC 
CGAGGGGTTGGGGGGTGGGTGTGGGGGAGCCGTTAGGCCTCCCAGGTCCTTAGGATCAGG 
GTTGCCCCCAGAACCCCTTCCCATATCCTCCATTCTCCGCCCTGAGTTCCTACCCAGGCT 
GCCTGGCTGGGGCCACTGCCTCCTCAGCATGCAGGAGGCTGCCCTGTAGGGAACCCCAGC 
TCTGGGGCTTGGGGGTGAGGGTCAGCCCTGGACAGACCTCTGCCCAGGGAACTGCTCCAT 
GGGGTCTGGGAGAGCAGCCATCCCGTGCTGGCACCATAGACCCACACAAGGAGCCTGCAC 
AGCAAGCCAGCGGTGACACACCTGCAGGTGTCAGGCATGGCACTGGGCACAACAGGGACC 
TGGCAGGAGAAACAGACCACAGAGAGGTCTGGAGTTGAGGCTGTTGTCAGCAAAGCCCCT 
GGTCCCACACAGCTCTGCCCTAGAGCCACCTCTTTGACCCTTTACCCACCCTGAGACCAG 
AACTTGCAGCCCCTCTGCAGATCTCCTCTGGCCACTGCAGCCCCTCCAATGGGCTTTTTC 
TCTCATGCATTCCCTGGCCTGGAGGCGTCAGGGACCCCACATCCTCCCTGCTCCTCAGAC 
TCACAGCCCCTCCATGTTACCTCCCGCACCTCCTCCCTGGGGCAGCTGCTCCCTGGGCCT 
CTGAGGATGTCAGCTCCTGGCTCCCTGCCTCTCTCCCACTCCACTCCTGGCTCAGTCTTA 
GAGATTTCTATGCCCTCATGGATTCTACCCCTGCCTTCCTGGCCTCTTGATTCTTGGCTT 
GCCTCTCCTCCAATTCCAAACTTAGTGAAATGGCCTTAAGCATTTTAAACTGTATGTATA 
CATTAGCGCATTCATGCCTTTCTAAACGCATTTCAAATGTCAACCAGGAAGGCACACCAC 
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TGTATTAGTTTTATACTGCCGCTGTAAAATTTACCACAAACTTAGTGACTTAACACAAAT 

TTATTGCAATTCTGTAGGCTGGAAGTCTGACTATGGGTCTCACTGGACTAGAATCAAGGC 

TGGCAGGCTGCCTTCCTTCCTGGAGGTTCTAGGGGAGACTCTGTCTCCTGCTCCTTCAGG 

CTGCTGGCAGAATCCACATCCTTTCGGTGGCAGGGCCAAGGTCCCCACTTTCTTGCTGAC 

TGTAAACTAAGGCCACTTCCAGCTTGTAGAGGCTGCCTACATTCCTTGGCTCTTGGCCCC 

CTCCTCCT^TCTTCAGAGCTAGCAGGTTCAGTCTGTGTCACGAACCATTTCTCTGGTTCCC 

TGCAGACAGGAAAGGTTGTCCCTAAGGACTCATGAGATTAGGTTGGGCCCAGCCAGATAA 

TACATGATAATCTCCCTCCTCAAGGTTTTTAATATTAAACACATCTGCAGGACACATTTT 

GCCATGTAAACTAACATTCACTGGTTCCAGGGATTAAGGAATGAACCTCTTTTGTTGGGG 

AAGGGTGGCATTCTGCTGACCACAGCACTCCAACCAAAAGCOUU^CC^^ 

TACTAACGCATATCAAATAAATTAAAGGTACAAAATCGTGAATCTCAGTTATCTTAAATA 

TTCCT^TACTATTTACAAAATTATTCAAATTCTCACGCCTTCCAACTCAAAATTAGCAA 

CTAAAGTAATTTCCATATCCTAGATGGA?^CCCTCATGCTAAACTGTCTGATTATGCATG 

GTTCTAAATGGTTTCAGTGGCAAATACATAACATTGTACTACTGATTAAACTGAACTTAA 

AAGC 

SEQ ID NO: 62_AA103218_M SGK034_M 

CCACGCGTCCGCACCAGAGTATGGCGAAGTCAATGATGGGACTGGCTTTGTGGACATCTT 

CTCCTTCGGGATGTGTGCACTGGAGATGGCTGTACTCGAGATCCAAGCCAACGGGGATAC 

CAGAGTCTVCTVGAAGAGGCCATCGCTCGAGCCAGGCACTCACTGAGTGACCCCTVACA^^ 

GGAATTCATCCTCTCCTGCCTGGCCCGGGACCCTGCCCGCCGACCCTCAGCCCACAACCT 

CCTCTTCCACCGAGTGCTCTTTGAGGTGCACTCGCTGAAGCTGCTGGCAGCTCACTGCTT 

CATCCAGCACCAGTACCTCATGCCTGAGAATGTGGTAGAGGAAAAGACCAAGGCCATGGA 

CCTCCATGCAGTTTTGGCTGAGATGCCGCAGCCCCATGGACCCCCAATGCAGTGGCGGTA 

CTCAGAGGTCTCCTTCTTGGAGCTGGACAAATTCCTAGAGGATGTCAGGAACGGGATCTA 

TCCACTGATGAACTTTGCGGCTGCTCGGCCCTTGGGGCTTCCCCGTGTGTTGGCCCCACC 

CCCAGAGGAAGCCCAAAAGGCCAAAACTCCAACGCCAGAACCCTTTGACTCGGAGACCAG 

GAAGGTGGTCCAGATGCAGTGCAACCTGGAAAGAAGCGAGGACAAGGCTCGGTGGCACCT 

TACTCTGCTCTTGGTGCTTGAGGACCGGCTACATCGGCAGCTGACCTATGATCTGCTCCC 

AACGGACAGTGCCCAGGACCTCGCTGCTGAACTAGTGCATTATGGCTTCCTGCACGAGGA 

TGACAGGACAAAGCTAGCAGCCTTTCTGGAGACCACTTTTCTCAAGTACCGAGGGACGCA 

AGCGTGACCTTCCCAGTCCTGACGGCCCAGCAGAGATACAGGGGCTCAGGGTTGTCCACT 

TGGCAAAGAGCCCCCACACTGCTCAAAGCTGCCTTCTGCCTGTGTTCCCTGGAACTGAAC 

ACAGGCCCTGCTAGTGAAGACACCCCCACCCCCCAGCTTTCTGCAGCAGTGTGGGACCCT 

GGGGTGGTGATGGAGCCCTGAGCCTGGACGAGAGTGGATACAGGTCAGTTAGGGGAACCG 

CTCCATCTGGTACTAGACAACAGCCATGCCTTCAGGTGGCATAGAAACCTAGGGAAGGAG 

CCTGAACTCAGGTGTCACAGTGCTGGGCATCAGGCAGACCAGACCTGACCTGATTGGAGA 

ACTGTAGACTAGATAGCTTGGAGTTGAACCCATGGCCAGGGAATTCCTTGGTCCTGCTCA 

GACCAGTCCTGATCCCTTGCAGACCTGCCTTGAGCCCTCTTTCTGATCTTCCACACTCTT 

GAGACCAGGACCTGTGTCCTCCCCAAAGCCCTTGGGAAGGATCTTTCTATTCATCATCCC 

TCTGGCCTAGGGGCTCAGGGGTCAGGCATCCTCCACATTCCCTCCCTGGGGAAGTTGTGT 

GTTTGAGTTGAGGATGTGGGTTCCTGGCTCCCTCTTTCTCCCCAGCCCAACTTGTCTCTT 

TCTTACTGGTTTCAAAGTCCTGATGAACGCTTCCCCTCAGAGCCACCCTGGTTTCCTTGG 

TTCTTGAACTGCCTCTCTCCCAACTTCAAACCAGGTCTTAAACGTTTTTTAAATGCATAT 

ATAAATGTAATGCAGTCACGGTCCTTTTTAAACACTTTGTGTATGAAACCAGGAAAGCTC 

ACTATTGTATTAGGAATAGTTCCACTITTGCTGCTGTTAACAGATATCATAAACCCAGTGG 

TTTGAGACGACACACACACACACACACTICACACACACACT^GAGAGAGAGAGA 

CATCAAGTGTGATCCAGGCTCTCACTAGATTAATACCCAGGCTAAGTTCCTTTCTGGAAG 

CTGGGACTTACCTCCTGCTCCTTCAAGCTATTGGCAGAACTCACTTCCCTGCAATGGTAA 

GGCAGAAATCCCTATTTTCTCAACAGCTGCCAACTAAGAACCCCTCTCAGCTTCTAGAGG 
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CCACCAACTTTTCTTAGTTCTTCTTTCTCCCCCCTCAAGACCAGCAGCGTCAAGTTGAAT 
CTTTGTCCTGGGCTAGCTGACTGGCTTGCCACTGCTGGGAAGAGTTGGGGCCTTTTGTGA 
GTAGGTTGGACCCACCAGGATAACCGAGGATGATCCCCTTCTCAGGGTCTATAGATGAAC 
CACACCTGCGCAGTTCCTTCTGCTGTCATCCTGGGCTTTGGTGCTTGGAGAACAGCCGTG 
GGCGGTGGGTGTTGTTACTGTGGTACCTACCATGCCATCTTAACCGAAACCAAGACCTAA 
AATAAAACAGATTTGTCATGGGACATCTAATAAATTAAATGAACTCTG 

SEQ ID NO: 63_NEK7_H, N34132_H 

CACGAATCCGAGCCCGCTCGCCTCTCTCCAGCGAACCGACCATGTCTGGCGGCGCCGCAG 
AGAAGCAGAGCAGCACTCCCGGTTCCCTGTTCCTCTCGCCGCCGGCTCCTGCCCCCAAGA 
ACGGCTCCAGCTCCGATTCCTCCGTGGGGGAGAAACTGGGAGCCGCGGCCGCCGACGCTG 
TGACCGGCAGGACCGAGGAGTACAGGCGCCGCCGCCACACTATGGACAAGGACAGCCGTG 
GGGCGGCCGCGACCACTACCACCACTGAGCACCGCTTCTTCCGCCGGAGCGTCATCTGCG 
ACTCCAATGCCACTGCACTGGAGCTTCCCGGCCTTCCTCTTTCCCTGCCCCAGCCCAGCA 
TCCCCGCGGCTGTCCCGCAGAGTGCTCCACCGGAGCCCCACCGGGAAGAGACCGTGACCG 
CCACCGCCACTTCCCAGGTAGCCCAGCAGCCTCCAGCCGCTGCCGCCCCTGGGGAACAGG 
CCGTCGCGGGCCCTGCCCCCTCGACTGTCCCCAGCAGTACCAGCAAAGACCGCCCAGTGT 
CCCAGCCTAGCCTTGTGGGGAGCAAAGAGGAGCCGCCGCCGGCGAGAAGTGGCAGCGGCG 
GCGGCAGCGCCAAGGAGCCACAGGAGGAACGGAGCCAGCAGCAGGATGATATCGAAGAGC 
TGGAGACCAAGGCCGTGGGAATGTCTAACGATGGCCGCTTTCTCAAGTTTGACATCGAAA 
TCGGCAGAGGCTCCTTTAAGACGGTCTACAAAGGTCTGGACACTGAAACCACCGTGGAAG 
TCGCCTGGTGTGAACTGCAGGATCGAAAATTAACAAAGTCTGAGAGGCAGAGATTTAAAG 
AAGAAGCTGAAATGTTAAAAGGTCTTCAGCATCCCAATATTGTTAGATTTTATGATTCCT 
GGGAATCCACAGTAAAAGGAAAGAAGTGCATTGTTTTGGTGACTGAACTTATGACGTCTG 
GAACACTTAAAACGTATCTGAAAAGGTTTAAAGTGATGAAGATCAAAGTTCTAAGAAGCT 
GGTGCCGTCAGATCCTTAAAGGTCTTCAGTTTCTTCATACTCGAACTCCACTTATCATTC 
ACCGCGATCTTAAATGTGACAACATCTTTATCACCGGCCCTACTGGCTCAGTCAAGATTG 
GAGACCTCGGTCTGGCAACCCTGAAGCGGGCTTCTTTTGCCAAGAGTGTGATAGGTACCC 
CAGAGTTCATGGCCCCTGAGATGTATGAGGAGAAATATGATGAATCCGTTGACGTTTATG 
CTTTTGGGATGTGCATGCTTGAGATGGCTACATCTGAATATCCTTACTCGGAGTGCCAAA 
ATGCTGCGCAGATCTACCGTCGCGTGACCAGTGGGGTGAAGCCAGCCAGTTTTGACAAAG 
TAGCAATTCCTGAAGTGAAGGAAATTATTGAAGGATGCATACGACAAAACAAAGATG 

gatattccatct^gaccttttgaaccatgccttcttccaagaggaaacaggagtacggg 

tagaattagcagaagaagatgatggagaaaaaatagccataaaattatggctacgtattg 

aagatattaagaaattaaagggaaaatacaaagataatgaagctattgagttttgttttg 

atttagagagagatgtcccagaagatgttgcacaagaaatggtagagtctgggtatgtct 

gtgaaggtgatcacaagaccatggctaaagctatcaaagacagagtatcattaattaaga 

ggaaacgagagcagcggcagttggtacgggaggagcaagaaaacaaaaagcaggaagaga 

gcagtctcaaacagcaggtagaacaatccagtgcttcccagacaggaatcaagcagctcc 

cttctgctagcaccggcatacctactgcttctaccacttcagcttcagtttctacacaag 

tagaacctgaagaacctgaggct^gatcaacatcaacaactactigtaccagcaaccca 

tatctgtgttatctgatgggacggttgacagtggtcagggatcctctgtcttcacagaat 

ctcgagtgagcagccaactigacagtttcatatgggttcccaanncatgaaca^ 

CTACAGGCACAGTCCCAGGGCATATACCTTCTACTGTCCTUIGCACAGTCTC^ 

GGGTATATCCACCCTCAAGTGTGCAGCAGGGAATACAGCAGACAGCCCCTCCTCAACAGA 

CAGTGCAGTATTCACTTTCACT^GACATCAACCTCCAGTGAGGCCACTACTGCA^ 

TGAGTCAGCCTCAAGCTCCACAAGTCTTGCCTCAAGTATCAGCTGGAAAACAGAGTACTC 

AGGGAGTCTCTCAGGTTGCTCCTGCAGAGCGAGTTGCAGTAGCACTVGCCCCAAGCT^ 

agccgaccactttggcttcctctgtagacagtgcacattcagatgttgcttcaggtatga . 
gtgatggcaatgagaacgtcccatcttccagtggaaggcatgaaggaagaactacaaaac 
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GGCATTACCGAAAATCTGTAAGGAGTCGCTCTCGACATGAAAAAACTTCACGCCCAAAAT 

TAAGAATTTTGAATGTTTCAAATAAAGGAGACCGAGTAGTAGAATGTCAATTAGAGACTC 

ATAATAGGAAAATGGTTACATTCAAATTTGACCTAGATGGTGACAACCCCGAGGAGATAG 

CAACAATTATGGTGAACAATGACTTTATTCTAGCAATAGAGAGAGAGTCGTTTGTGGATC 

AAGTGCGAGAAATTATTGAAAAAGCTGATGAAATGCTCAGTGAGGATGTCAGTGTGGAAC 

CAGAGGGTGATCAGGGATTGGAGAGTCTACAAGGAAAGGATGACTATGGCTTTTCAGGTT 

CTCAGAAATTGGAAGGAGAGTTCAAACAACCAATTCCTGCGTCTTCCATGCCACAGCAAA 

TAGGCATTCCTACCAGTTCTTTAACTCAAGTTGTTCATTCTGCGGGAAGGCGGTTTATAG 

TGAGTCCTGTGCCAGAAAGCCGATTACGAGAATCAAAAGTTTTCCCCAGTGAAATAACAG 

ATACAGTTGCTGCCTCTACAGCTCAGAGCCCTGGAATGAACTTGTCTCACrCTGCATCAT 

CCCTTAGTCTACAACAGGCCTTTTCTGAACTTAGACGTGCCCAAATGACAGAAGGACCCA 

ATACAGCACCTCCAAACTTTAGTCATACAGGACCAACATTTCCAGTAGTAC 

TAAGTAGCATTGCTGGAGTCCCAACCACAGCAGCAGCCT^CAGCACCAGTCCCTGCAAC^ 

GCAGCCCTCCTAATGACATTTCCACATCAGTAATTCAGTCTGAGGTTACAGTGCCCACTG 

AAGAGGGGATTGCTGGAGTTGCCACCAGCACAGGTGTGGTAACTTCAGGTGGTCTCCCCA 

TACCACCTGTGTCTGAATCACCT^GTACTTTCCAGCGTAGTTTCAAGTATCACAATAC 

CAGTTGTCTCAATATCTACTACATCCCCGTCACTTCTAGTCCCCACATCCACATCTG^^ 

TCGTTGTTTCTAGTACAGCACTGTATCCTTCAGTAACAGTTTCAGCAACTTCAGCCTCTG 

CTVGGGGGCAGTACTGCTACCCCAGGTCCTAAGCCTCCAGCTGTAGTATCTCAGCAGGC^ 

CAGGCAGCACTACTGTGGGAGCCTiiCATTAACATCAGTTTCTACCACCTVCT 

GCACAGCTTCACAGCTGTCCATTCAGCTTAGCAGCAGTACTTCTACTCCTACTTTAGCTG 

AAACCGTGGTAGTTAGCGC7VCACTCACTAGATAAGACATCTCATAGCAGTACAACTGGAT 

TGGCTTTCTCCCTCTCTGCACCATCTTCCTCTTCCTCTCCTGGAGCAGGAGTGTCTAGTT 

ATATTTCTCAGCCTGGTGGGCTGCATCCTTTGGTCATTCCATCAGTGATAGCTTCTACTC 

CTATTCTTCCCCAAGCAGCAGGACCTACTTCTACACCTTTATTACCCCAAGTACCTAGTA 

TCCCACCCTTGGTACAGCCTGTTGCCAATGTGCCTGCTGTACAGCAGACACTAATTCATA 

GTCAGCCTCT^CCAGCTTTGCTTCCCAACCAGCCCCATACTCATTGTCCTGAAGTAGATT 

CTGATACACAACCCATy^GCTCCTGGAATTGATGACATAAAGACTCTAGAAGAAAAGCTGC 

GGTCTCTGTTCAGTGAACACAGCTCATCTGGAGCTCAGCATGCCTCTGTCTCACTGGAGA 

CCTCACTAGTCATAGAGAGCACTGTCACACCAGGCATCCCAACTACTGCTGTTGCACCAA 

GCAAACTCCTGACTTCTACCACAAGTACTTGCTTACCACCAACCAATTTACCAC^ 

CAGTTGCTTTGCCAGTTACACCAGTGGTCACACCTGGGCAAGTTTCTACCCCAGTCAGCA 

CTACTACATCAGGAGTGAAACCTGGAACTGCTCCCTCCAAGCCACCTCTAACTAAGGCTC 

CGGTGCTGCCAGTGGGTACTGAACTTCCAGCAGGTACTCTACCCAGCGAGCAGCTGCCAC 

CTTTTCCAGGACCTTCTCTAACCCAGTCCCTVGCAACCTCTAGAGGATCTTGATGCTC^ 

TGAGAAGAACACTTAGTCCAGAGATGATCACAGTGACTTCTGCGGTTGGTCCTGTGTCCA 

TGGCGGCTCCAACAGCAATCACAGAAGCTIGGAACACAGCCTCAGAAGGGTGTTTCT^^ 

TCAAAGAAGGCCCTGTCCTAGCAACTAGTTCAGGAGCTGGTGCTTTTAAGATGGGACGAT 

TTCAGGTTTCTGTTGCAGCAGACGGTGCCCAGAAAGAGGGTAAAAATAAGTCAGAAGATG 

CAAAGTCTGTTCATTTTGAATCCAGCACCTCAGAGTCCTCAGTGCTATCAAGTAGTAGTC 

CAGAGAGTACCTTGGTGAAACCAGAGCCGAATGGCATAACCATCCCTGGTATCTCTTC7VG 

ATGTGCCAGAGAGTGCCCACAAAACTACTGCCTCAGAGGCAAAGTCAGACACTGGGCAGC 

CTACCAAGGTTGGACGTTTTCAGGTGACAACTACAGCAAACAAAGTGGGTCGTTTCTCTG 

TATCAAAAACTGAGGACAAGATCACTGACACT^GAAAGAAGGACCAGTGGCATCTCCTC 

CTTTTATGGATTTGGAACAAGCTGTTCTTCCTGCTGTGATACCAAAGAAAGAGAAGGCTG 

AACTGTCAGAGCCTTCACATCTAAATGGGCCGTCTTCTGACCCGGAGGCCGCTTTTTTAA 

GTAGGGATGTGGATGATGGTTCCGGTAGTCCACACTCGCCCCATCAGCTGAGCTCAAAGA 

GCCTTCCTAGCCAGAATCTAAGTCAAAGCCTTAGTAATTCATTTAACTCCTCTTACATGA 

GTAGCGACAATGAGTCAGATATCGAAGATGAAGACTTAAAGTTAGAGCTGCGACGACTAC 

GAGATAAACATCTCAAAGAGATTCAGGACCTGCAGAGTCGCCAGAAGCATGAAATTGAAT 
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CTTTGTATACCAAACTGGGCAAGGTGCCCCCTGCTGTTATTATTCCCCCAGCTGCTCCCC 

TTTCAGGGAGAAGACGACGACCCACTAAAAGCAAAGGCAGCAAATCTAGTCGAAGC^^ 

CCTTGGGGAATAAAAGCCCCCAGCTTTCAGGTAACCTGTCTGGTCAGAGTGCAGCTTCAG 

TCTTGCACCCCCAGCAGACCCTCCACCCTCCTGGCAACATCCCAGAGTCCGGGCAGAATC 

AGCTGTTACAGCCCCTTAAGCCATCTCCCTCCAGTGACAACCTCTATTCAGCCTTCACCA 

GTGATGGTGCCATTTCAGTACCAAGCCTTTCTGCTCCTIGGTCAAGGTAATAAAGCT^CCA 

TCATCGTCCAAAAACAATAAAATGGAGATGTTGCCATACCTGGGACAAAAGCCTGTTAAG 

GCGGGTTGGGAGACTAGCTGACCAGAACACAGCCTGTGTGTTGTACACTGAAGAATCTGG 

GTGAAAAGGGAAGTGGAGTGATAATGAGAATCGGTGGGCTCACTGCTCCCATTAGGTGAA 

ATTACTTTTTTTCAAGGAATTACT^GTGAAAAGTTACATCTGTGTGGCCTATATGACTTGC 

TCATTTGGGATTTGGAACTTAGGCTTTAATATTAGGCTGAGATTTCCTGGATGAAATTCT 

AAGGTGTTTTAGCAGTTTCTGAAGCTAATACATTTTCTTAGCCATTGTAGAATTTTGTTA 

CTTTTAAGTATGGGAGTGGCATACTAAAATGAATAACCTTACAATTCAGTTTTTTATCCA 

TAATCTACTTTCCAAATATAGCTCTGTTTATTAGTGATTGCTGAAAAAATTCCCACAGAG 

GAAAGAGCTTTTAGTCATATTAGAACAAGAATTGAAAAGACTTGGGCATCTGGGTGAGAA 

GAATGAAAAAAATATAGGTACTGGCTTATGTGCCTTTGCCACAGTTTC7VCAGAAATTAGA 

GATCAGTCTCTTCACAGGAAGAATGCACTTGATTGGTAAGGAGGGCAAACTAGCTAGCAT 

TATTCGAACTAAGAAAAGCTTCCGCATTTTGCAGATGGGTAGAATTAAGACCTAATATTT 

CATCTCTTACATATCTGACCTTCCCCCCAGAAGCTTGTTCTTCTGTGTGCCATCTTAGTG 

CATTTCACCACTCCAGCCTCAAGTTTCTAACATCTTGTAGTTGTGTTCTGTCTCTTCTCC 

TCTCTCTGTTCTACCCTGTTTTTCCCCTCTC7VC71GGCTGTGCGAAGTTTAACTGTGCATC 

TGAACAGGTGACATTCAAACCTGGTGGCAGGAGGACCCGATTTCTGAGTACGCCCTGCTT 

GGCTCTTTGTGTGTAACACCTTTACTCCTTCCTTGTCCTTGTGTTTCTGCTGCTTGGATC 

TGATGTTTCACGCAGTCCATTTTCATTTGTCTCTTTTTGTATATCATCTACTCAGTGGCT 

TGGCTGAATTACTGTTACCCTCAGAAGTTTGGGCCCCCACATTAATTATGATAAAAAATG 

TCTU^TAACT^GTTATCTACT^TTTCAATGTAACTTTCTGGTAGAAGTGCTTCTTCAT 

GGATCTGTGACT^GAGAGTGGATATGGTATCTAGGCAATAGATTGCTGGGTCATTTAGAAT 

AATGAAGACTGAACTCCACAGTCGTAGTCAGTGCTGTCTGTCTGCCCTAGCATTAGAAAT 

GAGAGAAATCAGCCAGACACGGTGGCGTACACCTGTAATCCCAGCACTTTGGGAGGCCGA 

GGCGGGAAGATTGCTTGAGGCCAGGAGCTCGAGACCAACCCTGGGCAACATGGTGATACC 

CCATCTCT 

SEQ ID NO: 64_BCGN3_H 

GCGGAGCGCAGCTGTGAGGGAGTCGCTGTGATCCGGGGCCCCGGAACCCGAGCTGGAGCT 

GAAGCGCAGGCTGCGGGGCGCGGAGTCGGGAGGCCTGAGTGTTCCTTCCAGCATGTCGGA 

GGGGGAGTCCCAGACAGTACTTAGCAGTGGCTCAGACCCAAAGGTAGAATCCTCATCTTC 

AGCTCCTGGCCTGACATCAGTGTCACCTCCTGTGACCTCCACAACCTCAGCTGCTTCCCC 

AGAGGAAGAAGAAGAAAGTGAAGATGAGTCTGAGATTTTGGAAGAGTCGCCCTGTGGGCG 

CTGGCAGAAGAGGCGAGAAGAGGTGAATCAACGGAATGTACCAGGTATTGACAGTGC^ 

CCTGGCCATGGATACAGAGGAAGGTGTAGAGGTTGTGTGGAATGAGGTACAGTTCTCTGA 

ACGCAAGAACTACAAGCTGCAGGAGGAAAAGGTTCGTGCTGTGTTTGATAATCTGATTCA 

ATTGGAGCATCTTAACATTGTTAAGTTTCT^CAAATATTGGGCTGACTlTTAAAGAGAACi^ 

GGCCAGGGTCATTTTTATCACAGAATACATGTCATCTGGGAGTCTGAAGC^ 

GAAGACCAAAAAGAACCACAAGACGATGAATGAAAAGGCATGGAAGCGTTGGTG<^ 

AATCCTCTCTGCCCTAAGCTACCTGCACTCCTGTGACCCCCCCATCATCCATGGGAACCT 

GACCTGTGACACCATCTTCATCCAGCACAACGGACTCATCAAGATTGGCTCTGTGGCTCC 

TGACACTATCAACAATCATGTGAAGACTTGTCGAGAAGAGCAGAAGAATCTACACTTCTT 

TGCACCAGAGTATGGAGAAGTCACTAATGTGACAACAGCAGTGGACATCTACTCCTTTGG 

CATGTGTGCACTGGAGATGGCAGTGCTGGAGATTCAGGGCAATGGAGAGTCCTCATATGT 

GCCACAGGAAGCCATCAGCAGTGCCATCCAGCTTCTAGAAGACCCATTACAGAGGGAGTT 
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CATTC7UUUVGTGCCTGCAGTCTGAGCCTGCTCGCAGACCAACAGCCAGAGAACTTCTGTT 
CCACCCAGCATTGTTTGAAGTGCCCTCGCTCAAACTCCTTGCGGCCCACTGCATTGTGGG 
ACACCAACACATGATCCCAGAGAACGCTCTAGAGGAGATCACCAAAAACATGGATACTAG 
TGCCGTACTGGCTGAAATCCCTGCAGGACCAGGAAGAGAACCAGTTCAGACTTTGTACTC 
TCAGTCACCAGCTCTGGAATTAGATAAATTCCTTGAAGATGTCAGGAATGGGATCTATCC 
TCTGACAGCCTTTGGGCTGCCTCGGCCCCAGCAGCCACAGCAGGAGGAGGTGACATCACC 
TGTCGTGCCCCCCTCTGTCAAGACTCCGACACCTGAACCAGCTGAGGTGGAGACTCGCAA 
GGTGGTGCTGATGCAGTGCAACATTGAGTCGGTGGAGGAGGGAGTCAAACACCACCTGAC 
ACTTCTGCTGAAGTTGGAGGACAAACTGAACCGGCACCTGAGCTGTGACCTGATGCCAAA 
TGAGAATATCCCCGAGTTGGCGGCTGAGCTGGTGCAGCTGGGCTTCATTAGTGAGGCTGA 
CCAGAGCCGGTTGACTTCTCTGCTAGAAGAGACCTTGAACAAGTTCAATTTTGCCAGGAA 
CAGTACCCTCAACTCAGCCGCTGTCACCGTCTCCTCTTAGAGCTCACTCGGGCeAGGCCC 
TGATCTGCGCTGTGGCTGTCCCTGGACGTGCTGCAGCCCTCCTGTCCCTTCCCCCCAGTC 
AGTATTACCCTGTGAAGCCCCTTCCCTCCTTTATTATTCAGGAGGGCTGGGGGGGCTCCC 
TGGTTCTGAGCATCATCCTTTCCCCTCCCCTCTCTTCCTCCCCTCTGCACTTTGTTTACT 
TGTTTTGCACAGACGTGGGCCTGGGCCTTCTCAGCAGCCGCCTTCTAGTTGGGGGCTAGT 
CGCTGATCTGCCGGCTCCCGCCCAGCCTGTGTGGAAAGGAGGCCCACGGGCACTAGGGGA 
GCCGAATTCTACAATCCCGCTGGGGCGGCCGGGGCGGGAGAGAAAGGTGGTGCTGCAGTG 
GTGGCCCTGGGGGGCCATTCGATTCGCCTCAGTTGCTGCTGTAATAAAAGTCTACTTTTT 

GCT 

SEQ ID NO: 65_AA711829_M 

CTTAAGCAGTTTCTGAAGAAGACCAAAAAGAACCACAAGACTATGAATGAAAAGGCTTGG 

AAACGCTGGTGTACACAGATCCTCTCTGCCCTAAGCTACCTGCACTCCTGTGACCCTCCC 

ATCATCCATGGGAACCTGACCTGTGACACCATCTTCATCCAGCACAACGGACTCATCAAG 

ATTGGCTCTGTGGCTCCTGACACTATCAACAATCACGTGAAGACTTGCCGGGAAGAACAG 

AAGAACCTACACTTTTTTGCACCAGAGTATGGAGAAGTCACAAACGT^ 

GACATCTACTCCTTTGGCATGTGTGCACTGGAGATGGCAGTGCTGGAGATTCAGGGCAAT 

GGCGAGTCCTCATATGTGCCACAGGAAGCCATCAGCAGTGCCATCCAGCTACTAGAAGA 

TCT^TTACAGAGGGAGTTTATTCAAAAGTGCCTGCAGTCTGAGCCTGCTCGGAGACCAAC^ 

GCCAGAGAACTTCTGTTCCACCCAGCACTGTTTGAAGTGCCCTCACTCAAGCTTCTTGCT 

GCTCACTGTATCGTGGGGCACCAACACATGATCCCAGAGAACGCTCTAGAGGAGATCACC 

AAGAACATGGATACCAGTGCTGTACTAGCTGAAATTCCCGCAGGGCCAGGACGAGAACCA 

GTTCAGACTTTGTACTCTCAGTCACCAGCCCTAGAATTAGACAAATTCCTTGAAGATGTC 

AGGAATGGGATCTACCCTCTGACAGCCTTTGGGCTACCTCGGCCTCAGCAGCCACAGCAG 

GAGGAGGTGACATCACCTGTTGTGCCCCCCTCTGTCAAGACTCCAACTCCTGAGCCAGCT 

GAAGTGGAGACACGAAAGGTGGTGCTGATGCAGTGCAACATCGAATCTGTGGAGGAGGGA 

GTCAAACACCATCTAACACTTCTGCTGAAGCTGGAGGACAAATTGAACCGGCACCTGAGC 

TGTGACCTGATGCCAAATGAGAGCATCCCGGACTTGGCAGCTGAGCTGGTGCAGCTGGGC 

TTCATTAGTGAGGCTGATCAGAGCCGCCTGACTTCTCTGCTGGAGGAGACGCTCAACAAG 

TTCAACTTCACCAGGAACAGTACACTCAACACAGCCACTGTCACCGTCTCCTCGTAGAGC 

TCACTTGAGCCAGGCCCCTAGCCAGGCTGTGGCTGTCCCTGGGCATGCTGCAGTCCTCCT 

GTCCCTTCTCCCCAGTCAGTATTACCCTTCGCGCCCATATTATTTAGGAGGGCTTTAGGG 

GCTCCCTGGTTGAGTATCACCCTGCCCCTTCCCCTCTCTTCCTCCCCTCTGCACTTTGTT 

TACTTGTTTTGCACAGACGTGGGCCTGGGCCTTCTCAGCAGCCACCTTCTAGCTGGGGGC 

TAGTAGCTGACCTGCTGCCTCCTGCCCTACTTGTGTGGACAGGAGGCCCACGGGCACTGG 

GGAAGCTGAGTTCTACAATCCCGCTGGGGCGCATGGGCAGGAGAGAAAGGTGGTGCTGCA 

GGGGTGGCCCCCCGGGGGGGGCATTCGAATCACCTCAGTTGCTGCTGTAATAAAGTCTAC 

TTTTTGCT 
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FIGURE 2BBB 

SEQ ID NO: 66_AA099102_H 

ATGTCATCATGTGTCTCTAGCCAGCCCAGCAGCAACCGGGCCGCCCCCCAGGATGAGCTG 

GGGGGCAGGGGCAGCAGCAGCAGCGAAAGCCAGAAGCCCTGTGAGGCCCTGCGGGGCCTC 

TCATCCTTGAGCATCCACCTGGGCATGGAGTCCTTCATTGTGGTCACCGAGTGTGAGCCG 

GGCTGTGCTGTGGACCTCGGCTTGGCGCGGGACCGGCCCCTGGAGGCCGATGGCCAAGAG 

GTCCCCCTTGACACCTCCGGGTCCCAGGCCCGGCCCCACCTCTCCGGTCGCAAGCTGTCT 

CTGCT^GAGCGGTCCCAGGGTGGGCTGGCAGCCGGTGGCAGCCTGGACATGAACGGACGC 

TGCATCTGCCCGTCCCTGCCCTACTCACCCGTCAGCTCCCCGCAGTCCTCGCCTCGGCTG 

CCCCGGCGGCCGACAGTGGAGTCTCACCACGTCTCCATCACGGGTATGCAGGACTGTGTG 

CAGCTGAATC7VGTATACCCTGAAGGATGAAATTGGAAAGGGCTCCTATGGTGTCGTCAAG 

TTGGCCTACAATGAAi\ATGACAATACCTACTATGCAATGAAGGTGCTGTCCAAAAAGAAG 

CTGATCCGGCAGGCCGCTTTTCCACGTCGCCCTCCACCCCGAGGCACCCGGCCAGCTCCT 

GGAGGCTGCATCCAGCCCAGGGGCCCCATTGAGCAGGTGTACCAGGAAATTGCCATCCTC 

AAGAAGCTGGACCACCCCAATGTGGTGAAGCTGGTGGAGGTCCTGGATGACCCCAATGAG 

GACCATCTGTACATGGTGTTCGAACTGGTCAACCTU^GGGCCCGTGATGGAAGTGCCCACC 

CTCAAACCACTCTCTGAAGACCAGGCCCGTTTCTACTTCCAGGATCTGATCAAAGGCATC 

GAGTACTTACACTACCAGAAGATCATCCACCGTGACATCT^CCTTCCAACCTCCTGGTC 

GGAGAAGATGGGCACATCAAGATCGCTGACTTTGGTGTGAGCAATGAATTCAAGGGCAGT 

GACGCGCTCCTCTCCAACTACGTGGGCACGCCCGCCTTCATGGCTCCCGAGTCGCTCTCT 

GAGACCCGCTVAGATCTTCTCTGGGAAGGCO^AGGATGTTTGGGCCATGGGTGTGACACT^ 

TACTGCTTTGTCTTTGGCCAGTGCCCATTCATGGACGAGCGGATCATGTGTTTACACAGT 

AAGATCAAGAGTCAGGCCCT(^AATTTCCAGACCAGCCCGA<^TAGCTCAGGACTTG^ 

GACCTGATCACCCGTATGCTGGACAAGAACCCCGAGTCGAGGATCGTGGTGCCGGAAATC 

AAGCTGCACCCCTGGGTCACGAGGCATGGGGCGGAGCCGTTGCCGTCGGAGGATGAGAAC 

TGCACGCTGGTCGAAGTGACTGAAGAGGAGGTCGAGAACTCAGTCAAACACATTCCCAGC 

TTGGCAACCGTGATCCTGGTGAAGACCATGATACGTAAACGCTCCTTTGGGAACCCATTC 

GAGGGCAGCCGGCGGGAGGAACGCTCACTGTCAGCGCCTGGAAACTTGCTCACCAAAAAA 

CCAACCAGGGAATGTGAGTCCCTGTCTGAGCTCAAGGAAGCAAGGCAGCGAAGACAACCT 

CCAGGGCACCGACCCGCCCCCCGTGGGGGAGGAGGAAGTGCTCTTGTGAGAGGCAGTCCC 

TGCGTGGAAAGTTGCTGGGCCCCCGCCCCCGGCTCCCCCGCACGCATGCATCCACTGCGG 

CCGGAGGAGGCCATGGAGCCCGAGTAG 

SEQ ID NO: 67_5R69_17_2_H 

CCGGGATGTGAGCCTGGTGGTTGGCAGCTGGAGCCACGTCGGAGGGGGAAGTGTCGCAGC 

ATTCTCTGCAGGCATCACAGACCTGAGGCAGTGGCCTCCGGAGGGCACTGGACAGAAACA 

GCCATCCAAGTGGCTGAGTGGAGGGACCCTGCTCAAGTGCAGCTGCT^GTGGCCGGGG^ 

CCCTCAGGTAGGGATCGGGGCGCCTTGTCGCCGCCAGCCACGTGTGGCGTCCGGTACAGT 

CAGCAGAGTGCAGGGTGCGGGCACCAGGAAAGGGGGCGCAGGGGAACTCCCGCGGGCC 

GCGTTTGCAAACTTCTCGCCTGGGCAGGAGGCGGTCGTGGGAAAGAAGGTGGAAGAG 

GCTTTTTGGAACTGTGCACGGGACAGATTGGACGCACACCCCTCGGGAGGCGCGAAGGC^ 

TGGAAAATTTGAAGCATATTATCACCCTTGGCCAGGTCATCCACAAACGGTGTGAAGAGA 

TGAAATACTGCAAGAAACAGTGCCGGCGCCTGGGCC7VCCGCGTCCTCGGCCTGATCAAGC 

CTCTGGAGATGCTCCAGGACC7VAGGAAAGAGGAGCGTGCCCTCTGAGAAGTTAACCACAG 

CCATGAACCGCTTCAAGGCTGCCCTGGAGGAGGCTAATGGGGAGATAGi\AAAGTTCAGCA 

ATAGATCCAATATCTGCAGGTTTCTAACyiGCAAGCCAGGACAAAATACTCTTCAAGG^ 

TGAACAGGAAGCTGAGTGATGTCTGGAAGGAGCTCTCGCTGTTACTTCAGGTTGAGCAAC 

GCATGCCTGTTTCACCCATAAGCCAAGGAGCGTCCTGGGCACAGGAAGATCAGCAGGATG 

CAGACGAAGACAGGCGAGCTTTCCAGATGCTAAGAAGAGATAATGT^AAAAATAGAAGCTT 

CACTGAGACGATTAGAAATCAACATGAAAGAAATCAAGGAAACTTTGAGGCAGTGTAAGT 

TATCATGTGCCCTGCTGTTTCTGATGGCCCCCAAACTAGAAGTCATCAGTTTACTGGGAC 
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CCCAGCCTCCCGCTACCCCTGCATTTGTCCATTTTCTGTGCTGGATGGCTGGAAGCAGCC 

CACAGGTTTGGGGATCCATTCATGGCTAGCCCAGGCTTCTGTCCATGGAATAACATGTGG 

AGAGAGCTTCTTGACCAGTAAGATACCTTCTAGCAGCTGTCAAAGTACTTAAAAACCTCT 

ATGAATAGAATCAAAGCTTCAGTTCT^GTTGCTGAATTTCCAAGAAGAAATTCAAATCA^ 

TTTAAAATGCCCACTCT^TTCATTCATTCAACAAAACTGTGAGTATCTGGTTTATGCCAGA 

GGCCATGCAAAGAGGTAACTAAGATGCAGAGAAGGACACTGCCTTCCAGGAGCTCACGGG 

GTGGAGGAGGAAAGAGGAAAGACAGACAGTGAACACACAACAGCAAGGTTACTGAGCTTG 

AACTATGTCCCTAACTACTAGATCTGAAATGACTACGCCAGATGCCAGATGCTCAAGTGC 

CAAGCTCTGGGTAACAGGAATAGACATCCTTCCAGGATGAGAGAGATGAGTCTGGATGAG 

GGTTAAGGCTGGAGGGACAGGCGGGATTTGAAGAGGAGGGAAAGGAAGTGGATGACACAT 

TCTGTTAACTGTCCAGCTGTGTCTCTACTGGTCACTCAGAGGCACGGGAGCCGCTCCCTT 

QGGCTGAGTCCATCAGAAGCCCCAGCCACCACCAGCTCTGGTTCATGTAGTAGAGCTTCC 

CACTCACACATCAGAAATATGCCACCTCCCTTAGGACCCCTTCCTCTGCTCATTGACTCT 

TTTGTCTTCTTTCCTCTCGGGGGTGAGGTCAGATTTACCACCAAAATGCATGCAGGAGAT 

CCCGCAAGAGCAAATCAAGGAGATCAAGAAGGAGCAGCTTTCAGGATCCCCGTGGATTCT - 

GCTAAGGGAAAATGAAGTCAGCACACTTTATAAAGGAGAATACCACAGAGCTCCAGTGGC 

CATAAAAGTATTCAAAAAACTCCAGGCTGGCT^GCATTGCAATAGTGAGGCAGACTTTC^ 

TAAGGAGATCAAAACCATGAAGAAATTCGAATCTCCCAACATCCTGCGTATATTTGGGAT 

TTGCATTGATGAAACAGTGACTCCGCCTCAATTCTCCATTGTCATGGAGTACTGTGAACT 

CGGGACCCTGAGGGAGCTGTTGGATAGGGAAAAAGACCTCACACTTGGCAAGCGCATGGT 

CCTAGTCCTGGGGGCAGCCCGAGGCCTATACCGGCTACACCATTCAGAAGCACCTGAACT 

CCACGGAAAAATCAGAAGCTCAAACTTCCTGGTAACTCAAGGCTACCT^GTGAAGCTTGC 

AGGATTTGAGTTGAGGAAAACACAGACTTCCATGAGTTTGGGAACTACGAGAGAAAAGAC 

AGACAGAGTCAAATCTACAGCATATCTCTCACCTCAGGAACTGGAAGATGTATTTTATCA 

ATATGATGTAAAGTCTGAAATATACAGCTTTGGAATCGTCCTCTGGGAAATCGCCACTGG 

AGATATCCCGTTTCAAGGTGAAGAATGTGAAGACTGGCTCAGCCAGTGGCTGTAATTCTG 

AGAAGATCCGCAAGCTGGTGGCTGTGAAGCGGCAGCAGGAGCCACTGGGTGAAGACTGCC 

CTTCAGAGCTGCGGGAGATCATTGATGAGTGCCGGGCCCATGATCCCTCTGTGCGGCCCT 

CTGTGGATGAAATCTTAAAGAAACTCTCCACCTTTTCTAAGTAGTGTATCAAAATCTAAA 

CCAAGGAGTCTCTGGACAAGAAGCTGGGAGAGGCACGAACTGGACATCTCTCTCTCTCAT 

ATCCTTCGGCATTGGGTTATCTATGGGTGCAAGGAGTGGGCACGCTTCTCTGTTACAAAT 

AGAAAACGATTCCAGTCATACAGGACACATCCCACTCCAAATGATATTTCCAAAAACATA 

CCTCTGACAGTAACTTTGATAGATGGTTTGTCAAATGTATCTTTCTGGGTATCCACACCT 

CTTGGCAATGAAATTTGCAGCTCCTCCCTTCCATAAATGAAGTCTCTTTCCCCACCATTT 

GAATCTGGGCTGGCACTGTGACTTGATTTGATCAATAGAATGTGGAAGAAGTGACTGTAT 

GCCAGTTCCAAGCCTAGGTTTCAAGAGGCCTTATAAATGTCTGTTGGAACCTTACCCAGC 

CATGGACATGTTGAGTGAGCATGCTGGAGAATGAGAGACCACATGAAGCAGAAACATGCT 

TTCCTAGCTGAAGTCATACTAGCCCAACCAACATGGCAGCTAACACATGAAT^ 

TCAAGACCAGAAGAACCACTCAAGCAGATCCCAGCCCAAATTGCCCATTCACACAATCAG 

GAGCTAAATAAATTACTGTTGTCTTTT 

SEQ ID NO: 68_H85811_H 

GGCCCGGCCCCCTCCCCCGGCGCCGGCCACGGGAGGCGGTGATGCGGGCGCGGGCGGCCT 

CGGCTGCGCCGAGAGCGGAGACACAGGCTCAAGATGGCAGATTCCGACTGAGGCTGGGGG 

GGCCGAGCTCGCGCGCCGCTTTCCCGTCCCCGTTGCCATGAACCGCGGACACCCCGGCCC 

CGATGGCCCCCGTGTACGAAGGTATGGCCTCACATGTGCAAGTTTTCTCCCCTCACACCC 

TTCAATCAAGTGCCTTCTGTAGTGTGAAGAAACTGAAAATAGAGCCGAGTTCCAACTGGG 

ACATGACTGGGTACGGCTCCCACAGCAAAGTGTATAGCCAGAGCAAGAACATCCCCCTGT 

CGCAGCCAGCCT^CCACAACCGTCAGCACCTCCTTGCCGGTCCCAAACCCAAGCCT^^ 

ACGAGCAGACCATCGTCTTCCCAGGAAGCACCGGGCACATCGTGGTCACCTCAGCAAGCA 
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GCT^CTTCTGTCACCGGGCAAGTCCTCGGCGGACCACTVCAACCTAATGCGTCGAAGCACTG 

TGAGCCTCCTTGATACCTACCAAAAATGTGGACTCAAGCGTAAGAGCGAGGAGATCGAGA 

ACACAAGCy^GCGTGCy^GATCATCGAGGAGCATCCACCCATGATTCAGAATAATGCAAGCG 

GGGCCACTGTCGCCACTGCCACCACGTCTACTGCCACCTCCAAAAACAGCGGCTCCAACA 

GCGAGGGCGACTATCAGCTGGTGCAGCATGAGGTACTGTGCTCCATGACCAACACCTACG 

AGGTCTTAGAGTTCTTGGGCCGAGGGACGTTTGGGCAAGTGGTCAAGTGCTGGAAACGGG 

GCACCAATGAGATCGTAGCCATCAAGATCCTGAAGAACCACCCATCCTATGCCCGACAAG 

GTCAGATTGAAGTGAGCATCCTGGCCCGGTTGAGCACGGAGAGTGCCGATGACTATAACT 

TCGTCCGGGCCTACGAATGCTTCCAGCACAAGAACCACACGTGCTTGGTCTTCGAGATGT 

TGGAGCAGAACCTCTATGACTTTCTGAAGCAAAACAAGTTTAGCCCCTTGCCCCTCAAAT 

ACATTCGCCCAGTTCTCCAGCAGGTAGCCACAGCCCTGATGAAACTCAAAAGCCTAGGTC 

TTATCCACGCTGACCTCAAACCAGAGAACATCT^TGCTGGTGGATCCTVTCTAGACAACC^ 

ACAGAGTCAAGGTCATCGACTTTGGTTCAGCCAGCCACGTCTGCAAGGCTGTGTGC^ 

CCTACTTGCAGTCCAGATATTACAGGGCCCCTGAGATCATCCTTGGTTTACCATTTTGTG 

AGGCAATTGACATGTGGTCCCTGGGCTGTGTTATTGCAGAATTGTTCCTGGGTTGGCCGT 

TATATCCAGGAGATTCGGAGTATGATCAGATTCGGTATATTTCACAAACACAGGGTTTGC 

CTGCTGAATATTTATTAAGCGCCGGGACAAAGACAACTAGGTTTTTCAACCGTGACACGG 

ACTCACCATATCCTTTGTGGAGACTGAAGACACCAGATGACCATGAAGCAGAGACAGGGA 

TTAAGTCAAAAGAAGCAAGAAAGTACATTTTCAACTGTTTAGATGATATGGCCCAGGTGA 

ACATGACGACAGATTTGGAAGGGAGCGACATGTTGGTAGAAAAGGCTGACCGGCGGGAGT 

TCATTGACCTGTTGAAGAAGATGCTGACCATTGATGCTGACAAGAGAATCACTCCAATCG 

AAACCCTGAACCATCCCTTTGTCACCATGACACACTTACTCGATTTTCCCCACAGCACAC 

ACGTCAAATCATGTTTCCAGAACATGGAGATCTGCAAGCGTCGGGTGAATATGTATGACA 

CGGTGAACCAGAGCAAAACCCCTTTCATCACGCACGTGGCCCCCAGCACGTCCACCAACC 

TGACCATGACCTTTAACAACCAGCTGACCACTGTCCACAACCAGCCCTCAGCGGCATCCA 

TGGCTGCAGTGGCCCAGCGGAGCATGCCCCTGCAGACAGGAACAGCCCAGATTTGTGCCC 

GGCCTGACCCGTTCCAGCAAGCTCTCATCGTGTGTCCCCCCGGCTTCCAAGGCTTGCAGG 

CCTCTCCCTCTAAGCACGCTGGCTACTCGGTGCGAATGGAAAATGCAGTTCCCATCGTCA 

CTCAAGCCCCAGGAGCTCAGCCTCTTCAGATCCAACCAGGTCTGCTTGCCCAGCAGGCTT 

GGCCAAGTGGGACCCAGCAGATCCTGCTTCCCCCAGCATGGCAGCAACTGACTGGAGTGG 

CCACCCACACCTCAGTGCAGCATGCCACCGTGATTCCCGAGACCATGGCAGGCACCCAGC 

AGCTGGCGGACTGGAGAAATACGCATGCTCACGGAAGCCATTATAATCCCATCATGCAGC 

AGCCTGCACTATTGACCGGTCATGTGACCCTTCCAGCAGCACAGCCCTTAAATGTGGGTG 

TGGCCCACGTGATGCGGCAGCAGCCAACCAGCACCACCTCCTCCCGGAAGAGTAAGCAGC 

ACCAGTCATCTGTGAGAAATGTCTCCACCTGTGAGGTGTCCTCCTCTCAGGCCATCAGCT 

CCCCACAGCGATCCAAGCGTGTCAAGGAGAACACACCTCCCCGCTGTGCCATGGTGCACA 

GTAGCCCGGCCTGCAGCACCTCGGTCACCTGTGGGTGGGGCGACGTGGCCTCCAGCACCA 

CCCGGGAACGGCAGCGGCAGACAATTGtCATTCCCGACACTCCCAGCCCCACGGTCAGCG 

TCATCACCATCAGCAGTGACACGGACGAGGAGGAGGAACAGAAACACGCCCCCACCAGCA 

CTGTCTCCAAGCAAAGAAAAAACGTCATCAGCTGTGTCACAGTCCACGACTCCCCCTACT 

CCGACTCCTCCAGCAACACCAGCCCCTACTCCGTGCAGCAGCGTGCTGGGCACAACAATG 

CCAATGCCTTTGACACCAAGGGGAGCCTGGAGAATCACTGCACGGGGAACCCCCGAACCA 

TCATCGTGCCACCCCTGAAAACCCAGGCCAGCGAAGTATTGGTGGAGTGTGATAGCCTGG 

TGCCAGTCAACACCAGTCACCACTCGTCCTCCTACAAGTCCAAGTCCTCCAGCAACGTGA 

CCTCCACCAGCGGTCACTCTTCAGGGAGCTCATCTGGAGCCATCACCTACCGGCAGCAGC 

GGCCGGGCCCCCACTTCCAGCAGCAGCAGCCACTCAATCTCAGCCAGGCTCAGCAGCACA 

TCACCACGGACCGCACTGGGAGCCACCGAAGGCAGCAGGCCTACATCACTCCCACCATGG 

CCCAGGCTCCGTACTCCTTCCCGCACAACAGCCCCAGCCACGGCACTGTGCACCCGCATC 

TGGCTGCAGCCGCTGCCGCTGCCCACCTCCCCACCCAGCCCCACCTCTACACCTACACTG 

CGCCGGCGGCCCTGGGCTCCACCGGCACCGTGGCCCACCTGGTGGCCTCGCAAGGCTCTG 
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CGCGCCACACCGTGCAGCACACTGCCTACCCAGCCAGCATCGTCCACCAGGTCCCCGTGA 
GCATGGGCCCCCGGGTCCTGCCCTCGCCCACCATCCACCCGAGTCAGTATCCAGCCCAAT 
TTGCCCACCAGACCTACATCAGCGCCTCGCCAGCCTCCACCGTCTACACTGGATACCCAC 
TGAGCCCCGCCAAGGTCAACCAGTACCCTTACATATAAACACTGGAGGGGAGGGAGGGAG 
GGAGGGAGGGAGAGAATGGCCCGAGGGAGGAGGGAGAGAAGGAGGGAGGCGCTCCTGGGA 
CCGTGGGCGCTGGCCTTTTATACTGAAGATGCCGCACACAAACAATGCAAACGGGGCAGG 
GGCGGGGGGGGGGGGGCAGAGGGCAGGGGGACGGGTCGGGACACCAGTGAAACTTGAACC 
GGGAAGTGGGAGGACGTAGAGCAGAGAAGAGAACATTTTTAAAAGGAAGGGATTAAAGAG 
GGTGGGAAATCTATGGTTTTTATTTTAAAAAAG 

SEQ ID NO: 69_DYRK3_H 

CGGGAGCGAAAGTGCGCTGAGCTGCAGTGTCTGGTCGAGAGTACCCGTGGGAGCGTCGCG 

CCGCGGAGGCAGCCGTCCCGGCGTAGGTGGCGTGGCCGACCGGACCCCCAACTGGCGCCT 

CTCCCCGAGCGGGGTCCCGAGCTAGGAGATGGGAGGCACAGCTCGTGGGCCTGGGCGGAA 

GGATGCGGGGCCGCCTGGGGCCGGGCTCCCGCCCCAGCAGCGGAGTTGGGGGATGGTGTC 

TATGACACCTTCATGATGATAGATGAAACCAAATGTCCCCCCTGTTCAAATGTACTCTGC 

AATCCTTCTGAACCACCTCCACCCAGAAGACTAAATATGACCGCTGAGCAGTTTACAGGA 

GATCATACTCAGCACTTTTTGGATGGAGGTGAGATGAAGGTAGAACAGCTGTTTCAAGAA 

TTTGGCAACAGAAAATCCAATACTATTCAGTCAGATGGCATCAGTGACTCTGAA 

TCTCCTACTGTTTCTCAGGGTAAAAGTTCAGATTGCTTGAATACAGTT^AAATCCAACAGT 

TCATCCAAGGCACCCAAAGTGGTGCCTCTGACTCCAGAAC7VAGCCCTGAAGCAATATAAA 

CACCACCTCACTGCCTATGAGAAACTGGAAATAATTAATTATCCAGAAATTTACTTTGTA 

GGTCCAAATGCCAAGAAAAGACATGGAGTTATTGGTGGTCCCAATAATGGAGGGTATGAT 

GATGCAGATGGGGCCTATATTCATGTACCTCGAGACCATCTAGCTTATCGATATGAGGTG 

CTGAAAATTATTGGCTVAGGGGAGTTTTGGGCAGGTGGCCAGGGTCTATGATCACA^ 

CGAGAGTACGTGGCCCTAAAAATGGTGCGCAATGAGAAGCGCTTTCATCGTCAAGCAGCT 

GAGGAGATCCGGATTTTGGAGCATCTTAAGAAACAGGATAAAACTGGTAGTATGAACGTT 

ATCCACATGCTGGAAAGTTTCACATTCCGGAACCATGTTTGCATGGCCTTTGAATTGCTG 

AGCATAGACCTTTATGAGCTGATTAAAAAAAATAAGTTTCAGGGTTTTAGCGTCCT^GTTG 

GTACGCAAGTTTGCCCAGTCCATCTTGCAATCTTTGGATGCCCTCCACAAAAATAAGATT 

ATTCACTGCGATCTGAAGCCAGAAAACATTCTCCTGAAACACCACGGGCGCAGTTC^ 

AAGGTCTITTGACTTTGGGTCCAGCTGTTTCGAGTACCAGAAGCTCTACACATATATCCAG 

TCTCGGTTCTACAGAGCTCCAGAAATCATCTTAGGAAGCCGCTACAGCACACCAATTGAC 

ATATGGAGTTTTCGCTGCATCCTTGCAGAACTTTTAACAGGACAGCCTCTCTTCCCTGGA 

GAGGATGAAGGAGACCAGTTGGCCTGCATGATGGAGCTTCTAGGGATGCCACCACCAAAA 

CTTCTGGAGCT^TCCAAACGTGCCTU^GTACTTTATTAATTCCAAGGGCATACCCCGCTAC 

TGCTCTGTGACTACCCAGGCAGATGGGAGGGTTGTGCTTGTGGGGGGTCGCTCACGTAGG 

GGTAAAAAGCGGGGTCCCCCAGGCAGCAAAGACTGGGGGACAGCACTGAAAGGGTGTG^ 

GACTACTTGTTTATAGAGTTCTTGAAAAGGTGTCTTCACTGGGACCCCTCTGCCCGCTTG 

ACCCCAGCTCAAGCATTAAGACACCCTTGGATTAGCAAGTCTGTCCCCAGACCTCTCACC 

ACCATAGACAAGGTGTCAGGGAAACGGGTAGTTAATCCTGCAAGTGCTTTCCAGGGATTG 

GGTTCTAAGCTGCCTCCAGTTGTTGGAATAGCCAATAAGCTTAAAGCTAACTTAATGTCA 

GAAACCAATGGTAGTATACCCCTATGCAGTGTATTGCCAAAACTGATTAGCTAGTGGACA 

GAGATATGCCCAGAGATGCATATGTGTATATTTTTATGATCTTACAAACCTGCAAATGGA 

AAAAATGCAAGCCCATTGGTGGATGTTTTTGTTAGAGTAGACTTTTTTTAAACAAGACAA 

AACATTTTTATATGATTATAAAAGAATTCTTCAAGGGCTAATTACCTAACCAGCTTGTAT 

TGGCCATCTGGAATATGCATTAAATGACTTTTTATAGGTCA 
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SEQ ID NO: 70_AA589241_iyi DYRK3_M 

CCACGCGTCCGGAGTTGCTAGGAATGCCACCGCAGAAACTTCTGGAGCAATCCAAGCGTG 
CCAAGTACTTTATTAACTCCAAAGGCTTGCCTCGATACTGCTCCGTATCTACCCAGACGG 
ACGGGAGGGTGGTGCTTCTCGGGGGTCGCTCACGCAGGGGTAAAAAGCGAGGCCCGCCAG 
GCAGCAAAGACTGGGCAACCGCACTGAAGGGCTGTGGTGACTACTTGTTCATAGAGTTTC 
TGAAACGATGCCTCCAGTGGGACCCCTCTGCCCGCCTCACCCCGGCTCAAGCATTAAGAC 
ATCCTTGGATTAGCAAGTCTACACCCAT^CCTCTCACCATGGACAAGGTGCCAGGGAAGC 
GGGTAGTTAACCCTACy^TGCTTTCCAGGGACTGGGTTCCAAGCTGCCTCCAGTCGTTG 
GGATAGCCAGTAAGCTTAAAGCTAACCTAATGTCCGAAACCAGTGGTAGTATACCTCTGT 
GCAGTGTATTGCCAAAGCTGATTAGCTAGTGGACCACTCAGAGACTGATACATATCATAT 
GTATTTTTAATTACCTTGCAAACATGCAAATGGAAAACGGAATAATTGAAGCCCATTCAC 
TGATGGATATGTTTTTGTTAGACTTTTTTTTAACAAGGCAGAACATTTTTATATGACTAT 
AAAAGAACGCTTCAAGGGCTAATGTCAAACCAGCTTGTATTGGCCATCTGGAGTATACAT 
TAAATGACTTTTTCATAGGTC 

SEQ ID NO: 71_5R72_16_2_H 

GTCGAGGCGCAGCGCTGCCATGGCTGGGGGCCGTGGGGCCCCCGGGCGCGGCCGGGACGA 

GCCTCCGGAGAGCTACCCGCAACGACAGGACCACGAGCTACAGGCCCTGGAGGCCATCTA 

CGGCGCGGACTTCCAAGACCTGCGGCCGGACGCTTGCGGACCGGTCAAAGAGCCCCCTGA 

AATCAATTTAGTTTTGTACCCTCAAGGCCTAACTGGTGAAGAAGTATATGTAAAAGTGGA 

TTTGAGGGTTAAATGCCCACCTACCTATCCAGATGTAGTTCCTGAAATAGAGTTAAAAAA 

TGCCAAAGGTCTATCAAATGAAAGTGTCAATTTGTTAAAATCTCGCCTAGAAGAACTGGC 

CAAGAAACACTGTGGGGAGGTGATGATCTTTGAACTGGCTTACCACGTGCAGTCATTTCT 

CAGCGAGCATAACAAGCCCCCTCCCAAGTCTTTTCATGAAGAAATGCTGGAAAGGCGGGC 

TCAGGAGGAGCAGCAGAGGCTGTTGGAGGCCAAGCGGAAAGAAGAGCAGGAGCAACGTGA 

AATCCTGOVTGAGATTCAGAGAAGGAAAGAAGAGATAAAAGAAGAGAAAAAAAGGAAAGA 

AATGGCTAAGCAGGAACGTTTGGAAATTGCTAGTTTGTCAAACCAAGATCATACCTCTAA 

GAAGGACCCAGGAGGACACAGAACGGCTGCCATTCTACATGGAGGCTCTCCTGACTTTGT. 

AGGAAATGGTAAACATCGGGCAAACTCCTCAGGAAGGTCTAGGCGAGAACGTCAGTATTC 

TGTATGTAATAGTGAAGATTCTCCTGGCTCTTGTGAAATTCTGTATTTCAATATGGGGAG 

TCCTGATCAGCTCATGGTGCACAAAGGGAAATGTATTGGCAGTGATGAACAACTTGGAAA 

ATTAGTCTACAATGCTTTGGAAACAGCCACTGGTGGCTTTGTCTTGTTGTATGAGTGGGT 

CCTTCAGTGGCAGAAAAAAATGGGTCCATTCCTTACCAGTCAAGAAAAAGAGAAGATTGA 

TAAGTGCAAAAAGCAGATTCAAGGAACAGAAACAGAATTCAACTCACTGGTAAAATTGAG 

CCATCCAAATGTAGTACGCTACCTTGCAATGAATCTCAAAGAGCAAGACGACTCCATCGT 

GGTGGACATTTTAGTGGAGCACATTAGTGGGGTCTCTCTTGCTGCACACCTGAGCCACTC 

AGGCCCCATCCCTGTGCATCAGCTTCGCAGGTACACAGCTCAGCTCCTGTCAGGCCTTGA 

TTATCTGCACAGCAATTCTGTGGTGCATAAGGTCCTGAGTGCATCTAATGTCTTGGTGGA 

TGCAGAAGGCACCGTCAAGATTACGGACTATAGCATTTCTAAGCGCCTCGCAGACATTTG 

CAAGGAGGATGTGTTTGAGCAAACCCGAGTTCGTTTTAGTGACAATGCTCTGCCTTATAA 

AACGGGGAAGAAAGGAGATGTTTGGCGTCTTGGCCTTCTGCTGCTGTCCCTCAGCCAAGG 

ACAGGAATGTGGAGAGTACCCTGTGACCATCCCTAGTGACTTACCAGCTGACTTTCAAGA 

TTTTCTAT^GAAATGTGTGTGCTTGGATGACAAGGAAAGATGGAGTCCCCAGCAGTTGTT 

GAAACACAGCTTTATAAATCCCCAGCCAAAAATGCCTCTAGTGGAACAAAGTCCTGAAGA 

TTCTGGAGGACAAGATTATGTTGAGACTGTTATTCCTAGCAACCGGCTACCCAGTGCTGC 

CTTCTTTAGTGAGACACAGAGACAGTTTTCCCGATACTTCATTGAGTTTGAAGAATTACA 

ACTTCTTGGTAAAGGAGCTTTTGGAGCTGTCATCAAGGTGCAGAACAAGTTGGACGGCTG 

CTGCTACGCAGTGAAGCGCATCCCCATCAACCCGGCCAGCCGGCAGTTCCGCAGGATCAA 

GGGCGAAGTGACACTGCTGTCACGGCTGCACCATGAGAACATTGTGCGCTACTACAACGC 

CTGGATCGAGCGGCACGAGCGGCCGGCGGGACCGGGGACGCCGCCCCCGGACTCCGGGCC 

si///; 
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CCTGGCCAAGGATGACCGAGCTGCACGCGGGCAGCCGGCGAGCGACACAGACGGCCTGGA 

CAGCGTAGAGGCCGCCGCGCCGCCACCCATCCTCAGCAGCTCGGTGGAGTGGAGCACTTC 

GGGCGAGCGCTCGGCCAGTGCCCGTTTCCCCGCCACCGGCCCGGGCTCCAGCGATGACGA 

GGACGACGACGAGGACGAGCACGGTGGCGTCTTCTCCCAGTCCTTCCTGCCTGCTTCAGA 

TTCTGAAAGTGATATTATCTTTGACAATGAAGATGAGAACAGTAAAAGTCAGAATCAGGA 

TGAAGATTGCAATGAAAAGAATGGCTGCCATGAAAGTGAGCCATCAGTGACGACTGAGGC 

TGTGCACTACCTATACATCCAGATGGAGTACTGTGAGAAGAGCACTTTACGAGACACCAT 

TGACCAGGGACTGTATCGAGACACCGTCAGACTCTGGAGGCTTTTTCGAGAGATTCTGGA 

TGGATTAGCTTATATCCATGAGAAAGGAATGATTCACCGGGATTTGAAGCCTGTCAACAT 

TTTTTTGGATTCTGATGACCATGTGAAAATAGGTGATTTTGGTTTGGCGACAGACCATCT 

AGCCTTTTCTGCTGACAGCAAACAAGACGATCAGACT^GGAGACTTGATTAAGTCAGACCC 

TTCTVGGTCACTTAACTGGGATGGTTGGCACTGCrCTCTATGTAAGCCCAGAGGTCCAAGG 

AAGCACCAAATCTGCATACAACCAGAAAGTGGATCTCTTCAGCCTGGGAATTATCTTCTT 

TGAGATGTCCTATCACCCCATGGTCACGGCTTCAGAAAGGATCTTTGTTCTCAACCAACT 

CAGAGATCCCACTTCGCCTAAGTTTCCAGAAGACTTTGACGATGGAGAGCATGCAAAGC^ 

GAAATCAGTCATCTCCTGGCTGTTGAACCACGATCCAGCAAAACGGCCCAC^^ 

GCTGCTCAAGAGTGAGCTGCTGCCCCCACCCCAGATGGAGGAGTCAGAGCTGCATGAAGT 

GCTGCACCACACGCTGACCAACGTGGATGGGAAGGCCTACCGCACCATGATGGCCCAGAT 

CTTCTCGCAGCGCATCTCCCCTGCCATCGATTACACCTATGACAGCGACATACTGAAGGG 

CAACTTCTCAATCCGTACAGCCAAGATGCAGCAGCATGTGTGTGAAACCATCATCCGC^ 

CTTTAAAAGACATGGAGCTGTTCAGTTGTGTACTCCACTACTGCTTCCCCGAAACAGACA 

AATATATGAGCACAACGAAGCTGCCCTATTCATGGACCACAGCGGGATGCTGGTGATGCT 

TCCTTTTGACCTGCGGATCCCTTTTGCAAGATATGTGGCAAGAAATAATATATTGAATTT 

AAAACGATACTGCATAGAACGTGTGTTCAGGCCGCGCAAGTTAGATCGATTTCATCCCAA 

AGAACTTCTGGAGTGTGCATTTGATATTGTCACTTCTACCACCAACAGCTTTCTGCCCAC 

TGCTGAAATTATCTACACTATCTATGAAATCATCCAAGAGTTTCCAGCACTTCAGGAAAG 

AAATTACAGTATTTATTTGAACCATACCATGTTATTGAAAGCAATACTCTTACACTGTGG 

GATCCCAGAAGATAAACTCAGTCAAGTCTACATTATTCTGTATGATGCTGTGACAGAGAA 

GCTGACGAGGAGAGAAGTGGAAGCTAAATTTTGTAATCTGTCTTTGTCTTCTAATAGTCT 

GTGTCGACTCTACAAGTTTATTGAACAGAAGGGAGATTTGCAAGATCTTATGCCAACAAT 

AAATTCATTAATAAAACAGAAAACAGGTATTGCT^CT^GTTGGTGAAGTATGGCTTAAAA 

CCTAGAGGAGGTTGTTGGACTGTTGAAGAAACTCGGCATCAAGTTACAGGTCTTGATCAA 

TTTGGGCTTGGTTTACAAGGTGCAGCAGCACT^TGGAATCATCTTCCAGTTTGTG 

CATCAAACGAAGGCT^GGGCTGTACCTGAAATCCTCGCAGCTGGAGGCAGATATGACC 

GCTGATTCCCCAGTTTAGAGGGCCACAAGCTCTGGGGCCAGTTCCCACTGCCATTGGGGT 

CAGCATAGCTATAGACAAGATATCTGCTGCTGTCCTCAACATGGAGGAATCTGTTACAAT 

AAGCTCTTGTGACCTCCTGGTTGTAAGTGTTGGTCAGATGTCTATGTCCAGGGCCATCAA 

CCTAACCCAGAAACTCTGGACAGCAGGCATCACAGCAGAAATCT^TGTACGACTGGTCACA 

GTCCCAAGAGGAATTACAAGAGTACTGCAGACATCATGAAATCACCTATGTGGCCCTTGT 

CTCGGATAAAGAAGGAAGCCATGTCAAGGTTAAGTCTTTCGAGAAGGAAAGGCAGACAGA 

GAAGCGTGTGCTGGAGACTGAACTTGTGGACCATGTACTGCAGAAACTGAGGACTAAAGT 

CACTGATGAAAGGAATGGC7VGAGAAGCTTCCGATAATCTTGCAGTGCAAAATCTGAA 

GTCATTTTCTAATGCTTCAGGTTTGTTTGAAATCCATGGAGCAACAGTGGTTCCCATT^ 

GAGTGTGCTAGCCCCGGAGAAGCTGTCAGCCAGCACTAGGAGGCGCTATGAAACTCAGGT 

ACAAACTCGACTTCAGACCTCCCTTGCCAACTTACATCAGAAAAGCAGTGAAATTGAAAT 

TCTGGCTGTGGATCTACCCT^GAAACAATATTACAGTTTTTATCATTAGAGTGGC^ 

TGATGAACAGGCATTTAACACAACTGTGAAGCAGCTGCTGTCACGCCTGCCAAAGCAAA 

ATACCTCAAATTAGTCTGTGATGAAATTTATAACATCAAAGTAGAAAAAAAGGTGTCTGT 

GCTATTTCTGTACAGCTATAGAGATGACTACTACAGAATCTTATTTTAACCCTAAAGAAC 

TGTCGTTAACCTCATTCAAACAGACAGAGGCTTATACTGGAATAATGGAATGTTGTACAT 
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TCATCATAATTTAAAATTAAATTCTAAGAAGAGGCTGGGTGCAGTGGCrCACACCTTT^ 

TCCCAGCACTTTGGGAAGCCAAGGCAGGAAGACTGCTTGAAACCAGGAGTTTGAGACC^ 

CCT 

SEQ ID NO: 73_R43524_H, HRI_H 

ATGCTGGGGGGCAACTCCGGGGTCCGCAAGCGCGAAGAGGAGGGCGACGGGGCTGGGGCT 

GTGGGTGCGCCGCCGGCCATCGACTTTCCCGCCGAGGGCCCGGACCCCGAATATGACGAA 

TCTGATGTTCCAGCAGAAATCCAGGTGTTAAAAGAACCCCTACAACAGCCAACCTTCCCT 

TTTGCAGTTGCA/^CCAACTOTTGCTGGTTTCTTTGCTGGAGCACTTGAGCCACGTGCAT 

GAACCAAACCCACTTCGTTCAAGACAGGTGTTTAAGCTACTTTGCCAGACGTTTATCAAA 

ATGGGGCTGTTGTCTTCTTTCACTTGTAGTGACGAGTTTAGCTCATTGAGACTACATCAC 

AACAGAGCTATTACTVCACTTAATGAGGTCTGCTAAAGAGAGAGTTCGTCTVGGATCCTT^ 

GAGGATATTTCTCGTATCCAGAAAATCAGATC7VAGGGAAGTAGCCTTGGAAGCACAAACT 

TCACGTTACTTAAATGAATTTGAAGAACTTGTCATCTTAGGAAAAGGTGGATACGGAAGA 

GTATACAAGGTCAGGAATAAATTAGATGGTCAGTATTATGCAATAAAAAAAATCCTGATT 

AAGGGTGCAACTAAAACAGTTTGCATGAAGGTCCTACGGGAAGTGAAGGTGCTGGCAGGT 

CTTCAGCACCCCAATATTGTTGGCTATCACACCGCGTGGATAGAACATGTTCATGTGATT 

CAGCCACGAGCAGACAGAGCTGCCATTGAGTTGCCATCTCTGGAAGTGCTCTCCGACCAG 

GAAGAGGACAGAGAGCAATGTGGTGTTAAAAATGATGAAAGTAGCAGCTCATCCATTATC 

TTTGCTGAGCCCACCCCAGAAAAAGAAAAACGCTTTGGAGAATCTGACACTGAAAATCAG 

AATAACAAGTCGGTGAAGTACACCACCAATTTAGTCATAAGAGAATCTGGTGAACTTGAG 

TCGACCCTGGAGCTCCAGGAAAATGGCTTGGCTGGTTTGTCTGCCAGTTCAATTGTGGAA 

CAGCAGCTGCCACTCAGGCGTAATTCCCACCTAGAGGAGAGTTTCACATCCACCGAAGAA 

TCTTCCGAAGAAAATGTCAACTTTTTGGGTCAGACAGAGGCACAGTACCACCTGATGCTG 

CACATCCAGATGCAGCTGTGTGAGCTCTCGCTGTGGGATTGGATAGTCGAGAGAAACAAG 

CGGGGCCGGGAGTATGTGGACGAGTCTGCCTGTCCTTATGTTATGGCCAATGTTGCAACA 

AAAATTTTTCAAGAATTGGTAGAAGGTGTGTTTTACATACATAACATGGGAATTGTGC^ 

CGAGATCTGAAGCCAAGAAATATTTTTCTTCATGGCCCTGATCAGCAAGTAAAAATAGGA 

GACTTTGGTCTGGCCTGC^CAGACATCCTACAGAAGAACACAGACTGGACCAACAGAAAC 

GGGAAGAGAACACCAACACATACGTCCAGAGTGGGTACTTGTCTGTACGCTTCACCCGAA 

CAGTTGGAAGGATCTGAGTATGATGCCAAGTCAGATATGTACAGCTTGGGTGTGGTCCTG 

CTAGAGCTCTTTCAGCCGTTTGGAACAGAAATGGAGCGAGCAGAAGTTCTAAGAGGTTTA 

AGAACTGGTCAGTTGCCGGAATCCCTCCGTAAAAGGTGTCCAGTGCAAGCCAAGTATATC 

CAGCACTTAACGAGAAGGAACTCATCGCAGAGACCATCTGCCATTCAGCTGCTGCAGAGT 

GAACTTTTCCAAAATTCTGGAAATGTTAACCTCACCCTACAGATGAAGATAATAGAGCAA 

GAAAAAGAAATTGCAGAACTAAAGAAGCAGCTAAACCTCCTTTCTCAAGACAAAGGGG 

AGGGATGACGGAAAGGATGGGGGCGTGGGATGA 

SEQ ID NO: 74_17000057519457_H 

CACAAGAGCCCTTCCTGCAGGGAACCTCAGGCTTCAGAGAGCCGAAAAGTTGGGAGGCGT 
AACCACTTACAGGCCGGAAGTGTCCGGGGTGGACGCATTCGGGTAGCCGAAGAAGTCCCA 
GGATTGCCGAAGAAGTCCCAGGATTTCCGAAGCGAGCCGAAGCATCGCGACAGTTTTCAG 
AGACAGCTGATCGGTTGGAGCTGTTGCGCCGAGCAGTCATGGCGGCGGCCAGAGCTACTA 
CGCCGGCCGATGGCGAGGAGCCCGCCCCGGAGGCTGAGGCTCTGGCCGCAGCCCGGGAGC 
GGAGCAGCCGCTTCTTGAGCGGCCTGGAGCTGGTGAAGCAGGGTGCCGAGGCGCGCGTGT 
TCCGTGGCCGCTTCCAGGGCCGCGCGGCGGTGATCAAGCACCGCTTCCCCAAGGGCTACC 
GGCACCCGGCGCTGGAGGCGCGGCTTGGCAGACGGCGGACGGTGCAGGAGGCCCGGGCGC 
TCCTCCGCTGTCGCCGCGCTGGAATATCTGCCCCAGTTGTCTTTTTTGTGGACTATGCTT 
CCAACTGCTTATATATGGAAGAAATTGAAGGCTCAGTGACTGTTCGAGATTATATTCAGT 
CCACTATGGAGACTGAAAAAACTCCCCAGGGTCTCTCCAACTTAGCCAAGACAATTGGGC 
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AGGTTTTGGCTCGAATGCACGATGAAGACCTCATTCATGGTGATCTCACCACCTCCAACA 

TGCTCCTGAAACCCCCCCTGGAACAGCTGAACATTGTGCTCATAGACTTTGGGCTGAGTT 

TCATTTCAGCACTTCCAGAGGATAAGGGAGTAGACCTCTATGTCCTGGAGAAGGCCTTCC 

TCAGTACCCATCCCT^CACTGAAACTGTGTTTGAAGCCTTTCTGAAGAGCTACTCCACCT 

CCTCCAAAAAGGCCAGGCCAGTGCTAAAAAAATTAGATGAAGTGCGCCTGAGAGGAAGAA 

AGAGGTCCATGGTTGGGTAGAAGAATGTGTATGACAACCACACACAGTGAAGCTCTTTTT 

TCAAAGTAAATTTGAAGAAATGCTACAAGTATGAGATGAGATCTAAGTAAAGGTGTTAAG 

ATATTTTTAAGTGGTATGTGATCGTGTCATTATCATCTGCACTTCACTCAAGAGCTTACT 

ATGTGTCTAAGTCATGTTCTAGGCAGAATTGGGTATTTAAAGTAAATTGAGGACAGGCTT 

CTCCCAGATTGTGACATGTATATCTCAGATACATGGGTGTGGCATTGAACCACATT^TGA 

GAACATTATTCTCTTTTTAGTCCTTGTGAGACAAGGATGAAGTCTCAGTTGCTGATACTC 

GCTGAGCTTACTGGCCCTCTAACCCAGTGTTTTTTTTTGTTGTTGTTGTGTACATGTTAT 

ATTTATTTTGAAACCAGTTTAATGGGATACAACCAGCATTTTAAAAAATGAAATA 

CAGCATGGAAAATATCAGTGTATTGTTTTATGAAACTTTCACGTGTATATATAGACCAAG 

GATATGTGCTGAGTTTTGATGTCAAATATATTTCTCTTTCAGGGTCATGATC7VAAAAATG 

AAAAGTCTGCTTAACTCCAATTTCTCTTTTAAAAAAGCAGACTTACAGCTTTCAGGCAAC 

TGAT^TTCATGTTAACATGTTTTTATTTTTATTGCTTTGTATTTTTGTGGTTACCTTCTA 

AGACAAGTGATTGATCTAAAGTTCCTTTTAAGTTTATACCGCTAAACAAACTGAGTTGAT 

TTCTATCACAGGCAGTAAGTAGGTAGAGCAAAAATGGTGAAGTGACTTGTGAAGACTGAA 

GTTTGATGAAGTCTGGTTTAAGGC^CAGGTAAACTGAGTGTGGATGCTVAAAGTACCA^ 

GCTAGCTTTTAACCTTGCCCAGCCTCAGTTTCTTTTCTTAGAAGAAGCTATGTTTGGGTG 

GGAAGGGAAGAGAGGGATAAGAAAATACCTTTCTTCCTTGTAAACTCCAATCAACAAACA 

TATTTTGAGTGCCTTTTGTGTTCCTTGGCACCCTGTTGGGTATTGGGTACTTGGCACCCT 

GTTGGGTATTGGGTACAATGGTGAGCCAGACAGACACTIGCGCCTGTCCTTTTGTAAGAAT 

ATTTATTTTTATAAAAAAGTATAAAGTATACAGTGGGATGTTTTGATATACATTATGAAA 

TGATTGCTACAGCTGAGCTAATTAACACCCATCACCTCACATAGTTACTGTCTTGTTTCT 

TAATATGGACTITTTGCAGCTATGAATTTCCCTCTGCACACTGTTGTCATCACACACTCT 

AGTTTTGGTATTTTGTGTTTTTGTTTTCATTCATCTCAAAGTATTTTCTAATTTCCCTTG 

TGATTTCTTCTTTGACCCCTTGATTGTTTAGAAATCTGTTAATTTCCACACATTTGTAAA 

TGTTCCAATTTTTCTTTTGTTATTGCCAGCTTCATTCCATTGTGTTCAGAGATGATACAG 

TCAGTGCCTGTTCTTATGAAGCAAACATTCTATAATAGTAGGACCAGTACCCTGTCTGTT 

TCATTCACCACAGTCT^GCATGCCCCAAGTGCCCmGCATGGGGCGGATGGCC^ 

TGAAAACTTCCCTTCCTGGGTAGTTGTGACTAGTAGAGAGGAAAAATAATATAATTGCCT 

GCTTACTGCATGCCAGGCATTGGGCTGGGAATTTTTATATTGGATCTAAAATAACTCTTA 

AGTTAGGCATTATCCCCATTTTATAGATGGAGAAACTGGCCCCAAAAGGTGGGAACTTGT 

CCAAGACGTCACAGGTAGCAAGAGGTACTTTTACCTGGCTCCAAATCTGTGTTCTTTCCA 

CTGACAAATGAGATATGGGATATGGTGCATCTTTACAGTACTATAATAAGTATTGGCGTA 

TAACATTATTTTCAAGGAACTCCAAGGGCCACAGGAGCTGACAGGTTTTTCAATTAATAT 

TCCCAACATGAATGAGATGCCTCATTCCTCAGTTTCCTCACGTGTACTATAAGGCTAGTA 

CCTGCTTTGTTGGGGTATGGTTGGCTCGTGTGCATTAAGTCAACAAATCCCTAGT 

SEQ ID NO: 75_AA013524_M 

CTGGTGCAGCAGGGCGCCGAGGCGCGCGTTTTCCGTGGCCGCTTCCAGGGCCGCGCGGCC 
GTGGTGAAGCACCGCTTCCCGAAGAGTTACCGGCACCCGGAGCTGGAGGCGCGGCTCGGC 
CGTCGGCGGACGGTGCAGGAGGCGCGCGCGCTGCTCCGCTGCCGCCGTGCGGGGATAGCT 
GCCCCAGTCGTCTTCTTTGTGGACTATGCGTCTAACTGCTTATATATGGAAGAAATCGAA 
GACTCGGTGACTGTTCGGGATTATATCCAATCCACTATGGAGACTGAAAAGGACCCCCAG 
TGCCTCTTGGACCTGGCCAGGAGGATGGGGCAGGTTCTGGCCGGAATGCACGACCAAGAC 
CTCATTCACGGGGACCTCACCACCTCCAACATGCTCCTGAGGCGGCCCCTGGCGCAGCTG 
CACATCGTGCTCATCGACTTTGGGCTGAGCTTTGTCTCAGGACTGCCGGAAGATAAAGGC 
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GTCGACCTCTATGTCCTGGAGAAGGCCTTCCTCAGCACGCACCCCCACACCGAGACCGCG 
TTTGAAGCCTTTCTGAAGAGTTACGGGGCCTCGTCCAAGAAGTCCAGTCCAGTGCTGAAG 
AAGTTAGATGAGGTGCGCCTGAGAGGGCGAAAGCGGTCCATGGTCGGGTAGTGGAGCTGT 
GGTGAACTGGCTCACGGTGAAGGATGATGTAGACGAGGCTGGACCCCTCAGCAAAGCATG 
GGTTGTTAAGTGGTCTGTGATCGTGCTGGGCCACCACCATCCATGGCTCACTGTTCTCAG 
GGGCTTCATGTACATGAGGTTTATTCTGGGCAGAACTGGGTAGGTAGCCCAGGCTAGCCT 
TGAATTTATGGCAACATCCTACCTCAGCTTGCTTGGAAGAGGTTATAAGCCACCATACCT 
GACTTTGCACTGATTCTGTCAGAAAC 

SEQ ID NO: 76_17006l39801197_H, IRAKM_H 

ATGGCGGGGAACTGTGGGGCCCGCGGCGCGCTGTCGGCGCACACGCTGCTGTTCGACCTG 

CCGCCCGCGCTGCTCGGAGAGCTCTGCGCTGTTCTGGACAGCTGCGACGGCGCGCTGGGC 

TGGCGCGGCCTGGCAGAGAGACTTTCAAGCAGCTGGCTGGATGTTCGTCATATTGAAAAG 

TATGTAGACCT^GGTAAAAGTGGAACAAGAGAATTACTTTGGTCCTGGGCACAGAAAAAC 

AAGACCATCGGTGACCTTTTACAGGTCCTCCAGGAGATGGGACATCGTCGAGCTATTCAT 

TTAATTACAAACTATGGAGCAGTGTTGAGTCCTTCAGAGAAGAGTTATCAGGAAGGTGGA 

TTTCCAAATATATTATTCAAGGAAACAGCCAATGTCACCGTGGATAATGTTCTTATTCCT 

GAACATAATGAAAAAGGAGTACTGCTTAAATCTTCCATCAGCTTTCAAAATATCATAGAA 

GGAACTAGAAATTTCCACAAAGACTTCCTAATTGGAGAAGGAGAGATTTTTGAGGTATAC 

AGAGTGGAGATTCAAAACCTAACATATGCTGTCAT^TTATTTAAACAGGAGAAAAAAATG 

CAGTGTAAGAAGCATTGGAAGAGGTTTTTATCTGAGCTTGAAGTTTTACTACTGTTTCAT 

CACCCAAACATACTAGAGTTGGCTGCATATTTTACAGAGACTGAGAAGTTCTGTCTGATO 

TATCCATACATGAGAAATGGAACACTTTTTGACAGATTGCAGTGTGTAGGTGACACGGCC 

CCACTCCCTTGGCACATTCGAATCGGTATATTAATAGGAATATCCAAAGCCATTCACTAC 

CTGCACAACGTTCAACCATGCTCGGTCATCTGTGGCAGTATATCAAGTGCAAAC^ 

TTGGATGATCAGTTTCAACCCAAACTAACTGATTTTGCCATGGCACACTTCCGGTCCCAC 

CTAGAACATCAGAGTTGTACCATAAATATGACCAGCAGCAGCAGTAAACATCTGTGGTAC 

ATGCCAGAAGAGTAC7VTCAGACAGGGGAAACTTTCCATTAAAACAGATGTCTACAGCTTT 

GGAATTGTAATAATGGAAGTTCTAACAGGATGTAGAGTAGTGTTAGATGATCCAAAACAT 

ATCCAGCTGCGGGATCTCCTTAGAGAATTGATGGAGAAGAGAGGCCTGGATTCATGTCTC 

TCATTTCTAGATAAGAAAGTGCCTCCCTGCCCTCGGAATTTCTCTGCCAAGCTCTTCTGT 

TTGGCAGGCCGGTGTGCTGCAACGCGGGCAAAGTTAAGACCATCAATGGATGAAGTTTTA 

AATACTCTTGAAAGTACTCAAGCCAGCrTGTATTTTGCTGAAGATCCTCCCACATCACTA 

AAGTCCTTCAGGTGTCCTTCTCCTCTATTCCTGGAGAATGTACCAAGTATTCCAGTGGAA 

GATGATGAAAGCCAGAATAACy^TTTACTACCTTCTGATGAAGGCCrGAGGATAGACAGA 

ATGACTCAGAAAACTCCTTTTGAATGCAGCCAGTCTGAGGTTATGTTTCTGAGCTTGGAC 

AAAAAGCCAGAGAGCAAGAGAAATGAGGAAGCTTGCAACATGCCCAGTTCTTCTTGTGAA 

GAAAGTTGGTTCCCAAAGTATATAGTTCCATCCCAGGACTTAAGGCCCTATAAGGTAAAT 

ATAGATCCTTCTTCT^GAAGCTCCAGGGCATTCTTGCAGGAGCAGGCCAGTGGAGAGCT^G 

TGTTCCTCCAAATTTTCCTGGGATGAATATGAACAGTACAAAAAAGAATAA 



SEQ ID NO: 77_AA840598_M IRAJCM_M 

ATGTGGAAGAGATTTTTATCAGAACTGGAAGTTCTACTCCTGTTCCGTCACCCCCACATA 
CTAGAGCTGGCTGCATATTTCACGGAGACTGAGAAACTTTGTCTGGTTTATCCCTATATG 
AGCAACGGGACGCTTTTCGACAGATTACAGTGCACAAATGGCACAACCCCGCTTTCCTGG 
CACGTTCGAATCAGCGTATTGATAGGAATAGCCAAAGCCATCCAATACTTGCACAACACT 
CAGCCGTGCGCCGTCATCTGTGGCAACGTTTCCAGTGCAAACATACTCTTGGATGACC^^ 
CTCCAACCCAAACTAACGGATTTTGCTGCAGCGCACTTCCGACCCAATCTAGAGCAGCAG 
AGTTCTACCATAAATATGACCGGCGGTGGCAGGAAACATCTGTGGTACATGCCAGAAGAA 
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TACATCAGACAGGGAAGACTTTCCGTTAAAACTGATGTCTACAGCTTCGGAATCGTGATC 

ATGGAGGTTCTAACGGGCTGCAAAGTGGTGCTGGATGACCCGAAACACGTTCAGCTGCGG 

GACCTCCTCATGGAACTGATGGAGAAAAGAGGCCTAGACTCCTGCCTGTCCTTCTTAGAC 

AGGAAGATACCT^CCCTGTCCTCGGAACTTCTCTGC/U^GCTCTTCTCTCTGGCGGGCCGG 

TGTGTGGCAACGAAGGCCAAGTTAAGACCCACGATGGACGAAGTCCTGTCCTCTCTGGAG 

AGCACCCAGCCTAGCTTGTATTTTGCAGAAGACCCTCCCACGTCCTTGAAGTCCTTCAGG 

TGTCCTTCTCCACTGTTCTTGGATAATGTCCCAAGTATTCCAGTAGAAGATGATGAAAAC 

CAGAATAACCATTCAGTACCTCCCAAGGAAGTTTTGGGGACAGATAGAGTGACTCAGAAA 

ACCCCCTTTGAATGCAGCCAGTCTGAGGTCACCTTTCTAGGCTTGGACCGAAACAGAGGG 

AACAGGGGAAGTGAAGCGGATTGCAACGTGCCCAGTTCTTCTCATGAGGAATGCTGGTCC 

CCAGAGCTTGTGGCGCCATCCCAGGACTTAAGTCCTACTGTGATCAGTTTGGGCTCGTCT 

TGGGAAGTACCAGGCCATTCTTATGGGAGCAAGCCAATGGAGAAGAGGTGTTCCTCTGGG 

CTCTTTTGCAGTGAGCATGAACAGTCCAAAAAGCAGTGAATCCACCAGAAGATCAAGCAA 

AAAATAAAAGCAAACGTCACTGAAGGCACTGAGCAAATAGCATCCCCGTGAAAAGACACG 

AGCTCTGAGCTCCGTGAGTACAGCCAAGGGACCAACTGATGGAGAATTTGAATGGTGCAG 

ATTAGCAGCAAGGAAGTCTATTCCTTCCTCCAAACAGAATAATTTCAAGAGATGCTTTAT 

TCAAGTGACCGCCTCTCAGTCAAACCTGAGAAGCTAAACTGGAGCCAATCAGAATTATCC 

AAGATTCCGGGTTCTGACT^CCAAAACCTAGCAAAGAGTAGCAGGACAAGTCTCTCTCTT 

AAGTCTCTCACTCTCTCTCATCATCCGAGTGAGATCTTGGTATAGGTGAACAGAGAACCA 

CCCACCTTCCAGAACCAGAACCACCTTCTCCCCAAGCCAGCAGTCAGTCACTCACCATCA 

GO^GCCAGTAGTCACCAGCAGCCAATCATGATACAGTGTa^CTCTCCCTCTGCGCATGC 

CTACGTCCTTTATAAAACCCAGGTCTTCAGGGCCCACCCCTTTCTTTTTCCATCCTTGCT 

CAGAGGCAGCCTTTTGTATACATTCCCTGACCCCACCCCT^TTATATCTCTCATATGATA 

TCTGTTGCGTAGTGTGACTTTGTGGCATGACTTGGTTGTCAGATCATTTGCACAAGAACA 

AGCGAATACACAACAACT^GCCCACCATCATTACCACCGGCACTTAATGCT^ 

TGCTAGGGATACTGACAGTCTATTTGCTTCCCATGGTCATAGGGAAGTTGCTCAAATGCA 

AAGGTTGTAGGGAATGTCTAATTTGTAAATGGCGTCGGGTGCCTTTGGAAGGAATTGTGT 

TTTTACAGCO^GTTGCTACTCTTGTTTATCGCTGGTTAACCGGTCTGTCCGGAAGTGAGC 

O^GTCATCCTTGCTAGGGCTTTTTCTGTGTAGAGAGGGAATTCCAGTCCAAAGTCTGCT 

TCTCTGTATTTAAATTCTTAGAAGAGTTGCCTGTGGCATTCCAATTGTTATATAAAAAAA 

TTATATTAAAGAATTCCAGCACT 

SEQ ID NO: 78_AA088547_H 

ATGGCGAGTGCGGTCAGGGGGTCGAGGCCGTGGCCGCGGCTGGGGCTCCAGCTCCAGTTC 
GCGGCGCTGCTGCTCGGGACGCTGAGTCCACAGGTTCATACTCTCAGGCCAGAGAACCTC 
CTGCTGGTGTCCACCTTGGATGGAAGTCTCCACGCACTAAGCT^GCAGACAGGGGACCTG, 
AAGTGGACTCTGAGGGATGATCCCGTCATCGAAGGACCAATGTACGTCACAGAAATGGCC 
TTTCTCTCTGACCCAGCAGATGGCAGCCTGTACATCTTGGGGACCCAAAAACAACAGGGA 
TTAATGAAACTGCCATTCACCATCCCTGAGCTGGTTCATGCCTCTCCCTGCCGCAGCTCT 
GATGGGGTCTTCTACACAGGCCGGAAGCAGGATGCCTGGTTTGTGGTGGACCCTGAGTCA 
GGGGAGACCCAGATGACACTGACCACAGAGCX3TCCCTCCACCCCCCGCCTCTACATTGGC 
CGAACACAGTATACGGTCACCATGCATGACCCAAGAGCCCCAGCCCTGCGCTGGAACACC 
ACCTACCGCCGCTACTCAGCGCCCCCCATGGATGGGTCACCTGGGAAATACATGAGCCAC 
CTGGCGTCCTGCGGGATGGGCCTGCTGCTCACTGTGGACCCAGGAAGCGGGACGGTGCTG 
TGGACACAGGACCTGGGCGTGCCTGTGATGGGCGTCTACACCTGGCACCAGGACGGCCTG 
CGCCAGCTGCCGCATCTCACGCTGGCTCGAGACACTCTGCATTTCCTCGCCCTCCGCTGG 
GGCCACATCCGACTGCCTGCCTCAGGCCCCCGGGACACAGCCACCCTCTTCTCTACCTTG 
GACACCCAGCTGCTAATGACGCTGTATGTGGGGAAGGATGAAACTGGCTTCTATGTCTCT 
AAAGCACTGGTCCACACAGGAGTGGCCCTGGTGCCTCGTGGACTGACCCTGGCCCCCGCA 
GATGGCCCCACCACAGATGAGGTGACACTCCAAGTCTCAGGAGAGCGAGAGGGCTCACCC 
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AGCACTGCTGTTAGATACCCCTCAGGCAGTGTGGCCCTCCCAAGCCAGTGGCTGCTCATT 

GGACACCACGAGCTACCCCCAGTCCTGCACACCACCATGCTGAGGGTCCATCCCACCCTG 

GGGAGTGGAACTGCAGAGACAAGACCTCCAGAGAATACCCAGGCCCCAGCCTTCTTCTTG 

GAGCTATTGAGCCTGAGCCGAGAGAAACTTTGGGACTCCGAGCTGCATCCAGAAGAAAAA 

ACTCCAGACTCTTACTTGGGGCTGGGACCCCAAGACCTGCTGGCAGCTAGCCTCACTGCT 

GTCCTCCTGGGAGGGTGGATTCTCTTTGTGATGAGGCAGGTGGTGGAGAAGCAGCAGGAG 

ACCCCCCTGGCACCTGCAGACTTTGCTCACATCTCCCAGGATGCCCAGTCCCTGCACTCG 

GGGGCCAGCCGGAGGAGCCAGAAGAGGCTTCAGAGTCCCTCAAAGCAAGCCCAGCCACTC 

GACGACCCTGAAGCTGAGCAACTCACCGTAGTGGGGAAGATTTCCTTCAATCCCAAGGAC 

GTGCTGGGCCGCGGGGCAGGCGGGACTTTCGTTTTCCGGGGACAGTTTGAGGGACGGGCA 

GTGGCTGTCAAGCGGCTCCTCCGCGAGTGCTTTGGCCTGGTTCGGCGGGAAGTTCAACTG 

CTGCAGGAGTCTGACAGGCACCCCAACGTGCTCCGCTACTTCTGCACCGAGCGGGGACCC 

CAGTTCCACTACATTGCCCTGGAGCTCTGCCGGGCCTCCTTGCAGGAGTACGTAGAAAAC 

CCGGACCTGGATCGCGGGGGTCTGGAGCCCGAGGTCGTGCTGCAGCAGCTGATGTCTGGC 

CTGGCCCACCTGCACTCTTTACACATAGTGCACCGGGACCTGAAGCCAGGAAATATTCTC 

ATCACCGGGCCTGACAGCCAGGGCCTGGGCAGAGTGGTGCTCTCAGACTTCGGCCTCTGC 

AAGAAGCTGCCTGCTGGCCGCTGTAGCTTCAGCCTCCACTCCGGCATCCCCGGCACGGAA 

GGCTGGATGGCGCCCGAGCTTCTGCAGCTCCTGCCACCAGACAGTCCTACCAGCGCTGTG 

GACATCTTCTCTGCAGGCTGCGTGTTCTACTACGTGCTTTCTGGTGGCAGCCACCCCTTT 

GGAGACAGTCTTTATCGCCAGGCAAACATCCTCACAGGGGCTCCCTGTCTGGCTCACCTG 

GAGGAAGAGGTCCACGACAAGGTGGTTGCCCGGGACCTGGTTGGAGCCATGTTGAGCCCA 

CTGCCGCAGCCACGCCCCTCTGCCCCCCAGGTGCTGGCCCACCCCTTCTTTTGGAGCAGA 

GCCAAGCAACTCCAGTTCTTCCAGGACGTCAGTGACTGGCTGGAGAAGGAGTCCGAGC^ 

GAGCCCCTGGTGAGGGCACTGGAGGCGGGAGGCTGCGCAGTGGTCCGGGACAACTGGCAC 

GAGCACATCTCCATGCCGCTGCAGACAGATCTGAGAAAGTTCCGGTCCTATAAGGGGACA 

TCAGTGCGAGACCTGCTCCGTGCTGTGAGGAAGAAGAAGCACCACTACAGGGAGCTCCCA 

GTTGAGGTGCGACAGGCACTCGGCCAAGTCCCTGATGGCTTCGTCCAGTACTTCACAAAC 

CGCTTCCCACGGCTGCTCCTCCACACGCACCGAGCCATGAGGAGCTGCGCCTCTGAGAGC 

CTCTTCCTGCCCTACTACCCGCCAGACTCAGAGGCCAGGAGGCCATGCCCTGGGGCCACA 

GGGAGGTGA 

SEQ ID NO: 79_HGP_6644466 

GGAGGGTTCGAATTGCAACGGCAGCTGCCGGGCGTATGTGTTGGTGCTAGAGGCAGCTGC 
AGGGTCTCGCTGGGGGCCGCTCGGGACCAATTTTGAAGAGGTACTTGGCCACGACTTATT 
TTCACCTCCGACCTTTCCTTCCAGGCGGTGAGACTCTGGACTGAGAGTGGCTTTCACAAT 
GGAAGGGATCAGTAATTTCAAGACACCAAGCAAATTATCAGAAAAAAAGAAATCTGTATT 
ATGTTCAACTCCAACTATAAATATCCCGGCCTCTCCGTTTATGCAGAAGCTTGGCTTTGG 
TACTGGGGTAAATGTGTACCTAATGAAAAGATCTCCAAGAGGTTTGTCTCATTCTCCTTG 
GGCTGTAAAAAAGATTAATCCTATATGTAATGATCATTATCGAAGTGTGTATCAAAAGAG 
ACTAATGGATGAAGCTAAGATTTTGAAAAGCCTTCATCATCCAAACATTGTTGGTTATCG 
TGCTTTTACTGAAGCCAATGATGGCAGTCTGTGTCTTGCTATGGAATATGGAGGTGAAAA 
GTCTCTAAATGACTTAATAGAAGAACGATATAAAGCCAGCCAAGATCCTTTTCCAGCAGC 
CATAATTTTAAAAGTTGCTTTGAATATGGCAAGAGGGTTAAAGTATCTGCACCAAGAAAA 
GAAACTGCTtCATGGAGACATAAAGTCTTCAAATGTTGTAATTAAAGGCGATTTTGAAAC 
AATTAAAATCTGTGATGTAGGAGTCTCTCTACCACTGGATGAAAATATGACTGTGACTGA 
CCCTGAGGCTTGTTACATTGGCACAGAGCCATGGAAACCCAAAGAAGCTGTGGAGGAGAA 
TGGTGTTATTACTGACAAGGCAGACATATTTGCCTTTGGCCTTACTTTGTGGGAAATGAT 
GACTTTATCGATTCCACACATTAATCTTTCAAATGATGATGATGATGAAGATAAAACTTT 
TGATGAAAGTGATTTTGATGATGAAGCATACTATGCAGCGTTGGGAACTAGGCCACCTAT 
TAATATGGAAGAACTGGATGAATCATACCAGAAAGTAATTGAACTCTTCTCTGTATGCAC 
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TAATGAAGACCCTAAAGATCGTCCTTCTGCTGCACACATTGTTGAAGCTCTGGAAACAGA 
TGTCTAGTGATCATCTCAGCTGAAGTGTGGCTTGCGTAAATAACTGTTTATTCCAAAATA 
TTTACATAGTTACTATCAGTAGTTATTAGACTCTAAAATTGGCATATTTGAGGACCATAG 
TTTCTTGTTAACATATGGATAACTATTTCTAATATGAAATATGCTTATATTGGCTATT^G 
CACTTGGAATTGTACTGGGTTTTCTGTAAAGTTTTAGAAACTAGCTACATAAGTACTTTG 
ATACTGCTCATGCTGACTTAAAACACTAGCAGTAAAACGCTGTAAACTGTAACATTAAAT 
TGAATGACCATTACTTTTATTAATGATCTTTCTTAAATATTCTATATTTTAATGGATCTA 
CTGACATTAGCACTTTGTAGAGTACAAAATAAAGTCTACATTTGTTTAAAACACTGAACC 
TTTTGCTGATGTGTTTATCAAATGATAACTGGAAGCTGAGGAGAATATGCCTCAAAAAGA 
GTAGCTCCTTGGATACTTCAGACTCTGGTTACAGATTGTCTTGATCTCTTGGATCTCCTC 
AGATCTTTGGTTTTTGCTTTAATTTATTAAATGTATTTTCCATACTGAGTTTAAAATTTA 
TTAATTTGTACCTTAAGCATTTCCCAGCTGTGTAAAAACAATAAAACTCAAATAGGATGA 
TAAAGAATAAAGGACACTTTGGGTACCAGAAGGTGTCTCAGCATTATTTTATACTTC 

SEQ ID NO: 80_AA449542_M 

ATCTCCAAGAGGGTTGTCTCATTCTCCTTGGGCCGTGAAAAAGATAAGTCTTTTATGCGA 
TGATCATTATCGAACTGTGTATCAGAAGAGACTAACTGATGAAGCTAAGATTTTAAAAAA 
CCTTAATCACCCAAACATTATAGGATATCGTGCTTTTACTGAAGCCAGTGATGGTAGTCT 
GTGCCTTGCTATGGAGTATGGAGGTGAAAAGTCTCTGAATGACTTAATAGAAGAGCGGAA 
CAAAGACAGTGGAAGTCCTTTTCCAGCAGCTGTAATTCTCAGAGTTGCTTTGCACATGGC 
CAGAGGGCTAAAGTACCTGCACCAAGAAAAGAAGCTGCTTCATGGAGACATAAAGTCTTC 
AAATGTTGTAATTAAAGGTGATTTTGAAACAATTAAAATCTGTGATGTAGGAGTCTCTCT 
GCCATTGGATGAAAATATGACTGTGACTGATCCTGAGGCCTGTTATATTGGTACTGAGCC 
ATGGAAACCCAAGGAAGCGTTGGAAGAAAATGGCATCATTACTGACAAGGCAGATGTGTT 
TGCTTTTGGCCTTACTCTGTGGGAAATGATGACTTTATGTATTCCACACGTCAATCTTCC 
AGATGATGATGTTGATGAAGATGCAACCTTTGATGAGAGTGACTTCGATGATGAAGCATA 
TTATGCAGCTCTGGGGACAAGGCCATCCATCAACATGGAAGAGCTGGATGACTCCTACCA 
GAAGGCCATTGAACTCTTCTGTGTGTGCACTAATGAGGATCCTAAAGATCGCCCGTCTGC 
TGCACACATCGTTGAAGCTTTGGAACTAGATGGCCAATGTTGTGGTCTAAGCTCAAAGCA 
TTAACTTGTATGGGAACTGTTAACTAGATATATGTAGTTAATATAACTTATGGTAGCTAG 
ATTCTAGAAGTAGCTTTAACACTAGTGACCCCTGTCTAAGATGACTTAAGAATCAAGGGA 
CCATTGCTTTGTTACAGATCTTTTTAGATATTCTTGCTTCTTTAGTGGGTTACTAAAAAT 
TTCACTACGTACATGTGGTACAGATATCTGTCTGCTCATAGTGTCAGTCCTTCAGCTGGC 
CTGTCAGCCCATGCGCCCTGGGACTTGAGAAGAGTTCATAAACGTAGCTCCTAGGGTGTC 
TTGCCTCTCTACACTTAGCTTCTAATTTATTACTTTGTTTCTACTGATTGTGTCTTAAGT 
CTTTTAAAATAAATGTAAGAATAAACAATAAAAGACAGTTTTAGTACCAGG 

SEQ ID NO: 81_5R57_10_2_M TESK2_M 

GCTGCTGGACAGTGACTTGTATTTACCGTGGACTGTGAGAGTGAAACTGGCCTATGGCAT 
AGCAGTGGGCCTCAGCTACCTTCACTTCAAAGGCATTTTCCATCGGGACCTCACATCT^ 

GGTGTGTAGGCTTTGCTTTC 
SEQ ID NO: 82_AA232253_H 

ATGTCGTCTCTCGGTGCCTCCTTTGTGCAAATTAAATTTGATGACTTGGAGTTTTTTGAA 
AACTGCGGTGGAGGAAGTTTTGGGAGTGTTTATCGAGCCAAATGGATATCACAGGACAAG 
GAGGTGGCTGTAAAGAAGCTCCTCAAAATAGAGAAAGAGGCAGAAATACTCAGTGTCCTC 
AGTCACAGAAACATCATCCAGTTTTATGGAGTAATTCTTGAACCTCCCAACTATGGCATT 
GTCACAGAATATGCTTCTCTGGGATCACTCTATGATTACATTAACAGTAACAGAAGTGAG 
GAGATGGATATGGATCACATTATGACCTGGGCCACTGATGTAGCCAAAGGAATGCATTAT 
TTACATATGGAGGCTCCTGTCAAGGTGATTCACAGAGACCTCAAGTCAAGAAACGTTGTT 
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ATAGCTGCTGATGGAGTATTGAAGATCTGTGACTTTGGTGCCTCTCGGTTCCATAACCAT 

ACAACACACATGTCCTTGGTTGGAACTTTCCCATGGATGGCTCCAGAAGTTATCCAGAGT 

CTCCCTGTGTCAGAAACTTGTGACACATATTCCTATGGTGTGGTTCTCTGGGAGATGCTA 

ACAAGGGAGGTCCCCTTTAAAGGTTTGGAAGGATTACAAGTAGCTTGGCTTGTAGTGGAA 

AAAAACGAGAGATTAACCATTCCAAGCAGTTGCCCCAGAAGTTTTGCTGAACTGTTACAT 

CAGTGTTGGGAAGCTGATGCCAAGAAACGGCCATCATTCAAGCAAAT^^ 

GAGTCCATGTCAAATGACACGAGCCTTCCTGACAAGTGTAACTCATTCCTACACT^CA^ 

GCGGAGTGGAGGTGCGAAATTGAGGGAACTCTTGAGAGGCTAAAGAAACTAGAGCGTGAT 

CTCAGCTTTAAGGAGCAGGAGCTTAAAGAACGAGAAAGACGTTTAAAGATGTGGGAGCAA 

AAGCTGACAGAGCAGTCCAACACCCCGCTGCTGCCTTCCTTTGAGATTGGTGCATGGACG 

GAAGACGATGTGTATTGGTGGGTTCAGCAGCTCGTCAGAAAAGGTGACTCTTCAGCAGAG 

ATGAGTGTATATGCAAGCTTGTTTAAAGAAAACAACATTACAGGGAAGCGGCTGCTGCTG 

CTGGAGGAAGAAGACCTGAAAGACATGGGCATTGTCTCCAAGGGGCATATCATTCACTTC 

AAGTCAGCCATTGAGAAATTAACCCATGATTACATAAATTTGTTTCACTTCCCACCACTA 

ATTAAGGACTCAGGAGGTGAACCTGAAGAAAATGAGGAAAAAATAGTGAACCTGGAACTG 

GTTTTTGGTTTTCACTTGAAACCAGGAACTGGCCCACAGGATTGTAAGTGGAAAATGTAT 

ATGGAGATGGATGGGGATGAAATTGCAATAACCTACATAAAAGATGTGACATTCAACACT 

AACCTACCTGATGCGGAGATTTTAAAGATGACAAAGCCACCATTTGTAATGGAGAAGTGG 

ATTGTAGGAATAGCAAAAAGTCAGACTGTGGAGTGCACTGTCACATATGAGAGTGATGTT 

AGAACTCCAAAAAGCACTAAACATGTCCATTTGATTCAGTGGAGTAGAACAAAACCTCAG 

GATGAAGTGAAAGCAGTCCT^CTTGCCATTCAGACATTATTCACCAATTCAGATGGC;^ 

CCTGGAAGCAGGTCCGACTCAAGTGCTGATTGCCAGTGGTTAGATACTCTGAGGATGCGG 

CAGATTGCATCCAACT^CTTCTTTACAGCGTTCCCAGAGCT^TCCTATTCTGGGGTC^ 

TTCTTCTCACACTTTGATGGCCAGGATTCCTACGCTGCTGCTGTGAGACGGCCCCAGGTG 

CCCATTAAGTATCAACAGATTACACCTGTGAACCAGTCCAGAAGCTCGTCTCCTACTCAG 

TATGGACTGACCAAAAACTTCTCTTCCTTACATCTCAACTCTAGGGACAGTGGCTTT 

AGTGGCTATACTGACACCTCTTCAGAGAGGGGTCGATACTCAGACAGAAGCAG^^ 

TATGGACGTGGTAGTATATCACTCAATTCTTCTCCTAGAGGAAGATACAGTGGAAAGAGT 

CAGCATTCCACTCCATCAAGAGGAAGATACCCTGGAAAGTTCTACAGGGTTTCTCAGTCA 

GCACTCAATCCTCACCAGTCGCCTGACTTCAAGAGAAGCCCCAGGGACCTCCACCAACCC 

AACACCATACCAGGGATGCCTTTGCACCCTGAGACTGACTCAAGAGCCAGTGAAGAGGAC 

AGCT^GTCAGCGAAGGGGGCTGGACAAAAGTGGAATACCGGAAAAAGCCCCACAGGCCA 

TCTCCCGCCAAAACCAATAAAGAGAGAGCCAGAGGGGACCACCGTGGATGGAGAAACTTT 

TGA 

SEQ ID NO: 83_AI375137_H 

ATGGGAAATTATAAATCTAGACCAACCCAAACTTGTACTGATGAATGGAAGAAAAAAGTC 

AGTGAATCATATGTTATCACAATAGAAAGATTAGAAGATGACCTGCAGATCAAGGAAAAA 

GAACTGACAGAACTAAGGAATATATTTGGCTCTGATGAAGCCTTCAGTAAAGTCAATTTA 

AATTACCGCACTGAAAATGGGCTGTCTCTACTTCATTTATGTTGCATTTGTGGAGGCAAG 

AAATCACATATTCGAACTCTTATGTTGAAAGGGCTCCGCCCATCTCGACTGACAAGAAAT 

GGATTTACAGCCTTGCATTTAGCAGTTTACAAGGATAATGCAGAATTGATCACTTCTCTG 

CTTCACAGTGGAGCTGATATACAGCAGGTTGGATACGGTGGCCTCACTGCCCTCCATATT 

GCTACAATAGCTGGCCACCTAGAGGCTGCTGATGTGCTGTTGCAACATGGAGCTAATGTC 

AATATTCAAGATGCAGTTTTTTTCACTCCATTGCATATTGCAGCGTACTATGGAC^^ 

CAGGTAACTCGCCTTCTTTTGAAATTTGGTGCTGATGTAAATGTAAGTGGTGAAGTTGGA 

GATAGACCCCTCCACCTAGCATCTGCAAAAGGATTCTTGAATATTGCAAAACTCTTGATG 

GAAGAAGGCAGCAAAGCT^GATGTGAATGCTCAAGATAATGAAGACCATGTCCCACTCCAT 

TTCTGTTCTCGATTTGGACACCATGATATAGTTAAGTATCTGCTGCAAAGTGATTTGGAA 

GTTCAACCTCATGTTGTTAATATCTATGGAGATACCCCCTTACACCTGGCATGCTACAAT 
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GGCAAATTTGAAGTTGCCAAGGAAATCATCCAAATATCAGGAACAGAAAGTCT 

GAAAACATCTTCAGTGAAACAGCTTTTCATAGTGCTTGTACCTATGGCAAGAGCATTGAC 

CTAGTCAAATTTCTTCTTGATCAGAATGTCATAAACATC?^CCACCAAGGAAGGGATG^ 

CACACTGGATTACACTCTGCTTGCTACCACGGTCACATTCGCCTGGTTCAGTTCTTACTG 

GATAATGGAGCTGATATGAATCTAGTGGCTTGTGATCCCAGCAGGTCTAGTGGTGAAAAA 

GATGAGCAGACATGTTTGATGTGGGCTTATGAAAAAGGGCATGATGCCATTGTCACACTC 

CTGAAGCATTATAAGAGACCACAAGATGAATTGCCCTGTAATGAATATTCTCAGCCTGGA 

GGAGATGGCTCCTATGTGTCTGTTCCATCACCCTTGGGGAAGATTAAAAGCATGACAAAA 

GAGAAGGCAGATATTCTCCTCCTT^GAGCTGGATTGCCTTCACATTTCCATCTTCAGCTC 

TCAGAAATTGAGTTCCATGAGATTATTGGCTCAGGTTCTTTTGGGAAAGTATATAAAGGA 

CGATGCAGAAATAAAATAGTGGCTATAAAACGTTATCGAGCCAATACCTACTGCTCCAAG 

TCAGATGTGGATATGTTTTGCCGAGAGGTGTCCATTCTCTGCCAGCTCAATCATCCCTGC 

GTAATTCAGTTTGTGGGTGCTTGCTTGAATGATCCCAGCCAGTTTGCCATTGTC^ 

TACATATCAGGGGGTTCTCTGTTCTCCCTCCTTGATGAGCAGAAGAGGATTCTTGATTTG 

CAGTCTAAATTAATTATTGCAGTAGATGTTGCCAAAGGCATGGAGTACCTTCACT^CCTG 

ACACAGCCAATTATACATCGTGACTTGAACAGTCACAATATTCTTCTCTATGAGGATGGG 

CATGCTGTGGTGGCAGATTTTGGAGT^TCAAGATTTCTACy^GTCTCTGGATGAAGAC^ 

ATGACAAAACAACCTGGGAACCTCCGTTGGATGGCTCCTGAGGTGTTCACGCAGTGCACT 

CGGTACACCATCAAAGCAGATGTCTTCAGCTATGCTCTGTGTCTGTGGGAAATTCTCACT 

GGCGAAATTCCATTCGCTCATCTCAAGCCAGCGGCTGCGGCAGCAGACATGGCTTACCAC 

CACATCAGACCTCCCATTGGCTATTCCATTCCCAAGCCCATATCATCTCTGCTGATACGA 

GGGTGGAACGCATGTCCTGAAGGAAGACCCGAATTTTCTGAAGTTGTCATGAAGTTAGAA 

GAGTGTCTCTGCAACATTGAGCTGATGTCTCCTGCATCAAGTAACAGCAGTGGGTCTCTC 

TCACCTTCTTCTTCTTCTGATTGCCTGGTGAACCGGGGAGGACCTGGCCGGAGTCATGTG 

GCAGCATTAAGAAGTCGTTTCGAATTGGAATATGCTCTAAATGCAAGGTCCTATGCTGCT 

TTGTCCCAAAGTGCTGGACAATATTCCTCTCAAGGTCTGTCTTTGGAGGAGATGAAAAGA 

AGTCTTCAATACACACCCATTGACAAATATGGCTATGTATCCGATCCCATGAGCTCAATG 

CATTTTCATTCTTGCCGAAATAGTAGCAGCTTTGAGGACAGCAGCTGA 

SEQ ID NO: 84_H97685_H 

ATGATTTCTTGCCTGTNATAACCTATGCACTCACAAAGATGAACTCTCTGAGAGGGATGA 
GCAAGAGCTTCAGGAAATCCGAAAGTATTTCTCCTTTCCTGTATTCTTTTTCAAAGTGCC 
GAAACTGGGCTCGGAGATAATAGACTCCTCAACCAGGAGAATGGAGAGCGAAAGATCACC 
GCTTTATCGCCAGCTAATTGACCTGGGCTATCTGAGCAGCAGTCACTGGAACTGTGGGGC 
TCCTGGCCAGGATACTAAAGCTCAGAGCATGTTGGTGGAACAGAGTGAAAAGCTGAGACA 
CTTGAGCACATTTTCTCACCAGGTGTTACAGACTCGCCTGGTGGATGCAGCCAAGGCCCT 
GAACCTGGTGCACTGCCACTGCCTTGACATCTTTATTAACCAGGCATTTGACATGCAGCG 
GGACCTGCAGATCACTCCCAAACGTCTGGAATATACTCGAAAAAAGGAGAATGAGTTGTA 
TGAATCATTGATGAATATTGCCAACCGAAAGCAGGAGGAAATGAAGGATATGATTGTTGA 
GACACTTAATACCATGAAGGAGGAACTTCTGGATGATGCTACTAACATGGAGTTTAAAGA 
CGTCATTGTCCCTGAGAATGGAGAACCAGTAGGCACCAGAGAGATCAAA.TGCTGCATCCG 
ACAGATCCAGGAACTCATCATCTCCCGACTTAATCAGGCAGTGGCTAATAAGCTGATCAG 
CTCAGTGGATTACCTGAGGGAAAGCTTCGTCGGAACCCTGGAACGATGTCTGCAGAGCCT 
GGAGAAGtCTCAGGATGTCTCAGTTCACATCACCAGTAATTATCTCAAACAGATCTTAAA 
TGCTGCCTATCATGTTGAAGTCACGTTTCACTCAGGGTCGTCAGTTACAAGGATGCTATG 
GGAGCAAATCAAACAGATCATCCAGCGCATCACATGGGTGAGCCCACCTGCCATCACTCT 
CkSAATGGAAGAGGAAGGTGGCCCAGGAAGCCATTGAGAGCCTCAGCGCCTCCAAATTGGC 
TAAGAGCATTTGCAGCa^TTCCGGACTCGGCTCAATAGTTCCCACGAGGCTTTTGCAGC 
CTCCTTGCGGCAGCTGGAAGCTGGCCACTCAGGCCGGTTAGAGAAAACGGAAGATCTATG 
GCTGAGGGTTCGGAAAGATCATGCTCCCCGCCTGGCCCGCCTTTCTCTGGAAAGCCGTTC 
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TTTACAGGATGTCTTGCTTCATCGTAAACCTAAACTGGGACAGGAACTGGGCCGGGGCCA 
GTATGGTGTGGTATACCTGTGTGACAACTGGGGAGGACACTTCCCTTGTGCCCTCAAATC 
AGTTGrCCCTCCAGATGAGAAGCACTGGAATGATCTGGCTTTGGAATTTCACTATATGAG 
GTCTCTGCCGAAGCATGAGCGArTGGTGGATCTCCATGGTTCAGTCATTGACTACAACTA 
TGGTGGTGGCTCCAGCATTGCTGTGCTCCTCATTATGGAGCGGCTACACCGGGATCTCTA 
CACAGGGCTGAAGGCTGGGCTGACCCTGGAGACACGTTTGCAGATAGCACTAGATGTGGT 
GGAGGGAATCCGCTTCCTGCACAGCCAGGGACTTGTCCATCGTGATATCAAACTGAAAAA 
TGTGCTGCTGGATAAGCAGAACCGTGCCAAGATCACTGACTTAGGATTCTGCAAGCCAGA 
GGCCATGATGTCAGGCAGCATTGTGGGGACACCT^TCCATATGGCCCCTGAACTTTTC^^ 
AGGGAAGTACGATAATTCCGTGGATGTCTACGCTTTTGGAATTCTTTTCTGGTATATCTG 
CTCAGGCTCTGTCAAGCTCCCTGAGGCATTTGAGAGGTGTGCTAGCAAAGACCATCTCTG 
GAACAATGTGCGGAGGGGGGCTCGCCCAGAACGTCTTCCTGTGTTTGATGAGGAGTGCTG 
GCAGTTGATGGAAGCCTGTTGGGATGGCGACCCCTTGAAGAGGCCTCTCTTGGGCATTGT 
CCAGCCCATGCTCCAGGGCATCATGAATCGGCTCTGCAAGTCCAATTCTGAGCAGCCAAA 
CAGAGGACTAGATGATTCTACTTGAAAGCAAAGACCTTTCTCTTTCACTCTCTAGTTATT 
TCCTTCCCCCTCACCATTTGGCCATGGGGAGAATTTGACATTTATTCACTATAGGACACA 
CTCCCAAGGGAACTGGTGCTTGCTGGGAAACTTGGAACCTTCCCAGGCAGGGATGACTCC 
TGGACyVGTGAAGAGTTGAATGACTGAGCATATTCAGCAGCTCACTGAAGCGCCAAGCTAT 
CCCTTTAGCAAAAAAGTGTCTCAGATGTGTAAAAGCTGAGGAATGTGGTGTTCTGGCTTC 
ACAAATGAAAAGGAGGCAGATGTT 

SEQ ID NO: 85_W2 0810_M 

TTGATGTCAACCTGAAGGCTTCTAAAGCGAGTGATGTCTACAGCTTTGGGATCCTCGTGT 

GGGCAGTGCTGGCTGGCAGAGAAGCTGAGTTGGTAGACAAGACTTCACTAATCCGGGAAA 

CAGTGTGTGACAGGCAGAGTCGTCCTCCACTGACAGAGCTGCCTCCAGGTAGCCCTGAGA 

CTCCCGGCTTGGAAAAACTGAAGGAGTTAATGATTCATTGCTGGGGTTCCCAGTCCGAAA 

ACAGGCCATCCTTCCAGGACTGCGAACCAAAAACCAATGAAGTTTACAATCTGGTAAAGG 

ACAAGGTAGATGCTGCTGTCTCCGAGGTAAAGCATTATCTGTCTCAGCACAGAAGCTIGCG 

GCAGAAACTTGTCTGCCAGAGAGCCAAGCCAAAGAGGCACAGAAATGGATTGCCCGAGGG 

AAACCATGGTTTCTAAAATGCTGGACCGCCTGCATTTGGAGGAACCCTCCGGACCAGTTC 

CTGGAAAATGTCCTGAGAGGCAAGCACAGGACACATCAGTTGGGCCTGCCACACCAGCAA 

GGACATCTTCTGACCCCGTGGCTGGCACTCCTCAGATTCCACATACTTTACCCTTCAGAG 

GCACT^CACCTGGGCCAGTCTTTACTGAGACTCCCGGTCCTCACCCCCAAAGGAATC^ 

GAGATGGAAGACACGGCACTCCTTGGTATCCCTGGACCCCACCGAATCCAATGACAGGGC 

Cy^CCGGCTCTCGTCTTCAACAACTGTTCTGAAGTGCT^GATTGGGAACTACAACTCCTTGG 

TAGCACCACCAAGAACTACTGCCTCAAGTTCGGCCAAGTATGACCAAGCACAGTTCGGC^ 

GGGGTAGGGGCTGGCAGCCCTTCCACAAGTAGACTTCTIGAGAATCACTGCAA^ 

AGTGTGCCATTCAGCGTGGCAATAAAAAGCACGTTTTAAGCAACCTGGACTGGCTAAGAC 

AGTCCTTGCCACTTCCTGAAGCTCACAACATTCTGTGAGGACAGTTGGACCTACACCCAA 

ACTGACTCTTGACCCATCTCCTTAAAGTCAATAAACATAGCATGTTAACTGTG 

SEQ ID NO: 86_AA744236_H 

ATGGGATCAGAGAACAGTGCTTTAAAGAGCTATACACTGAGAGAACCACCATTTACCTTA 
CCCTCTGGACTTGCTGTTTATCCCGCTGTACTGCAAGATGGCAAATTTGCTTCAGTTTTT 
GTGTATAAGAGAGAAAATGAAGACAAGGTTAATAAAGCTGCCAAGCATTTGAAGACACTT 
CGTCACCCTTGCTTGCTAAGATTTTTATCTTGTACTGTGGAAGCGGATGGCATTCATCTT 
GTCACTGAGCGAGTACAGCCCCTGGAAGTGGCTTTGGAAACATTGTCTTCTGCAGAGGTC 
TGTGCTGGGATCTATGACATATTGCTGGCTCTTATCTTCCTTCATGACAGAGGACACCTA 
ACACACAATAATGTCTGTTTATCATCTGTGTTTGTGAGTGAAGATGGACACTGGAAGCTA 
GGAGGAATGGAAACTGTTTGTAAAGTTTCTCAGGCCACACCAGAGTTTCTGAGGAGTATT 
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CAGTCAATAAGAGACCCAGCATCTATCCCTCCTGAAGAGATGTCTCCAGAATTCACAACT 

CTCCCAGAGTGTCATGGACATGCCCGGGATGCCTTTTCATTTGGAACATTGGTGGAAAGT 

TTGCTCACAATCTTAAATGAACT^GGTTTCAGCGGATGTTCTCTCCAGCTTTC^ 

TTGCACTCAACTTTGCTGAATCCCATTCCAAAATGTCGGCCAGCGCTCTGCACCTTACTA 

TCTCTITGACTTCTTCAGAAATGATTTTCTGGAAGTTGTGAATTTCTTGAA?^ 

TTGAAGAGTGAAGAGGAGAAAACGGAATTCTTTAAATTTCTGCTGGACAGAGTCAGCTGC 

TTGTCAGAGGAATTGATAGCTTCAAGGTTGGTGCCTCTTCTGCTTAATCAGTTGGTGTTT 

GCAGAGCCAGTGGCTGTTAAGAGTTTTCTTCCTTATCTGCTTGGCCCCAAAAT^GATCAT 

GCGCAGGGAGAAACTCCTTGCTTGCTCTCACCAGCCCTGTTCCAGTCACGGGTGATCCCC 

GTGCTTCTCCAGTTGTTTGAAGTTCATGAAGAGCATGTGCGGATGGTGCTGCTGTCTCAC 

ATCGAGGCCTACGTGGAGCACTTCTVCTCAGGAGCAGCTGAAGAAAGTCATCTTGCCA^ 

GTTTTGCTGGGCCTGCGTGATACTAGCGATTCCATTGTGGCAATrACTCTGCATAGCCTA 

GCAGTGCTGGTCTCTCTGGTTGGACCAGAGGTGGTTGTGGGAGGAGAACGAACCAAGATC 

TTCAAACGCACTGCCCGAAGXTTTACTAAAAATACTGACCTTTCrCTAGAAGGCGATCCA' 

TTTTCTCAGCCTATTAAATTTCCCT^TAAATGGACTCTCAGATGTAAAAAATACTTCGGAG 

GACAGTGAAAACTTCCCATCAAGTTCTAAAAAGTCTGAGGAGTGGCCTGACTGGAGTGAA 

CCTGAGGAGCCTGAAAATCAAACTGTCAACATACAGATTTGGCCTAGAGAACCTTGTGAT 

GATGTCAAGTCCCAGTGCACTACCTTGGATGTGGAAGAGTCATCTTGGGATGACTGCGAG 

CCCAGCAGCTTAGATACTAAAGTAAACCCAGGAGGTGGAATCACrGCTACAAAACCTGTT 

ACCTCAGGGGAGCAGAAGCCTATTCCTGCTTTGCTTTCACTCACrGAAGAGTCTATGCCT 

TGGAAATCAAGCTTACCCCAAAAGATTAGCCTTGTACAAAGGGGGGATGACGCAGACCAA 

ATCGAGCCGCC/U^GTGTCATCACAAGAAAGGCCCCTTAAGGTTCCATCAGAACTTGGT 

TTAGGAGAGGAATTCACCATTCAAGTAAAAAAGAAGCCAGTAAAAGATCCTGAGATGGAT 

TGGTTTGCTGATATGATCCCAGAAATTAAGCCTTCTGCTGCTTTTCTTATATTACCTGAA 

CTGAGGACAGAAATGGTCCCAAAAAAGGATGATGTCTCCCCAGTGATGCAGTTTTCCTCA 

AAATTTGCTGCAGCAGAAATTACTGAGGGAGAGGCTGAAGGCTGGGAAGAAGAAGGGGAG 

CTGAACTGGGAAGATAATAACTGGTGA 

SEQ ID NO: 87_AI052250_H 

AGCGGCCGCGGGGGCGGCGGAGGATATGGAGTAAAGCCAGAGTCAGTGGCC7VGGCACGAA 

GGCAGAGCAGGAACAGCCAGGAGGCGTTTATTAGGGGGGCGGGGGGAAAGAGCCCCAGCA 

CCGCCCCTCCTGGAAGAAGGAAGAGGTAACTATAACTACCCAATATTGCAGCCATGGAGT 

COVTGCTTAATAAATTGAAGAGTACTGTTACAAAAGTCACAGCTGATGTCACTAGTGCGG 

TAATGGGAATTCCTGTCACTAGAGAATTTGATGTTGGTCGACACATTGCCAGTGGTTGCA 

ATGGGCTAGCTTGGAAGATTTTTAATGGCAO^W^AAGTCAACAAAGCAGGAAGTGGCAG 

TTTTTGTCTTTGATAAAAAACTGATTGACAAGTATCAAAAATTTGAAAAGGATCAAATCA 

TTGATTCTCTAAAACGAGGAGTCCAACAGTTAACTCGGCTTCGACACCCTCGACTTCTTA 

CTGTCCAGCATCCTTTAGAAGAATCCAGGGATTGCTTGGCATTTTGTACAGAACCAGTTT 

TTGCCAGTTTAGCCAATGTTCTTGGTAACrGGGAAAATCTACCTTCCCCTArATCTCCAG 

ACATTAAGGATTATAAACTTTATGATGTAGAAACCAAATATGGTTTGCTTCAGGTTTCTG 

AAGGATTGTCATTCTTGCATAGCAGTGTGAAAATGGTGCATGGAAATATCACTCCTGAAA 

ATATAATTTTGAATAAAAGTGGAGCCTGGAAAATAATGGGTTTTGATTTTTGTGTATCAT 

CAACCT^TCCTTCTGAACAAGAGCCTAAATTTCCTTGTAAAGAATGGGACCCAAATTT^ 

CTTCATTGTGTCTTCCAAATCCTGAATATrTGGCTCCTGAATACATACTTTCTGTGAGCT 

GTGAAACAGCCAGTGATATGTATTCTTTAGGAACTGTTATGTATGCTGTATTTAATAAAG 

GGAAACCTATATTTGAAGTCAACAAGCTUVGATATTTACTVAGAGTTTCAGTAGGC^ 

ATCAGTTGAGTCGTTTAGGATCTAGTTCACTTACAAATATACCTGAGGAAGTTCGTGAAC 

ATGTAAAGCTACTGTTAAATGTAACTCCGACTGTAAGACCAGATGCAGATCAAATGACAA 

AGATTCCCTTCTTTGATGATGTTGGTGCAGTAACACTGCAATATTTTGATACCTTATTCC 

AAAGAGATAATCTTCAGAAATCACAGTTTTTCT^GGACTGCCAAAGGTTCTACCAAAAC 
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TGCCCAAGCGTGTCATTGTGCAGAGAATTTTGCCTTGTTTGACTTCAGAATTTGTAAACC 
CTGACATGGTACCTTTTGTTTTGCCCAATGTTCTACTTATTGCTGAGGAATGCACCAAAG 
T^GAATATGTCAAATTAATTGTTCCTGAACTTGGCCCTGTGTTTAAGCAGCAGGAGCCAA 
TCCAGATTTTGTTAATTTTCCTACAAAAAATGGATTTGCTACTAACCAAAACCCCTCCTG 
ATGAGATAAAGAACAGTGTTCTACCCATGGTTTACAGAGCACTAGAAGCTCCTTCCATTC 
AGATCCAGGAGCTCTGTCTAAACATCATTCCAACCTTTGCAAATCTTATAGACTACCCAT 
CCATGAAAAACGCTTTGATACCAAGAATTAAAAATGCTTGCTACAAACATCTTCCCTTGC 
GGTTCGTGTAAATTCATTAAACAACATTGGAGCAGACCTTCTGACTGGCAGTGAGTCCG 

SEQ ID NO: 88_AA278842_H 

GGCGCGCCAGATATCACACGTGCCAAGGGGCTGGCTCAGCCCCGGCTCGGGCGGCCGGAG 
GACCCGGAGCTAAGGCGCCCGAACCCGCGGCGGCGGTGGGGACGATGTGGTTCTTTGCCC 
GGGACCCGGTCCGGGACTTTCCGTTCGAGCTCATCCCGGAGCCCCCAGAGGGCGGCCTGC 
CCGGGCCCTGGGCCCTGCACCGCGGCCGCAAGAAGGCCACAGGCAGCCCCGTGTCCATCT 
TCGTCTATGATGTGAAGCCTGGCGCGGAAGAGCAGACCCAGGTGGCCAAAGCTGCCTTCA 
AGCGCTTCAAAACTCTACGGCACCCCAACATCCTGGCTTACT^TCGATGGACTGGAGACAG 
AAAAATGCCTCCACGTCGTGACAGAGGCTGTGACCCCGTTGGGAATATACCTCAAGGCGA 
GAGTGGAGGCTGGTGGCCTGAAGGAGCTGGAGATCTCCTGGGGGCTACACCAGATCGTGA 
AAGCCCTCAGCTTCCTGGTCAACGACTGCAGCCTCATCCACAACAATGTCTGCATGGCCG 
CCGTGTTCGTGGACCGAGCTGGCGAGTGGAAGCTTGGGGGCCTGGACTACATGTATTCGG 
CCCAGGGCAACGGTGGGGGACGTCCCCGCAAGGGGATCCCCGAGCTTGAGCAGTATGACC 
CCCCGGAGTTGGCTGACAGCAGTGGCAGAGTGGTCAGAGAGAAGTGGTCAGCAGACATGT 
GGCGCTTGGGCTGCCTCATTTGGGAAGTCTTCAATGGGCCCCTACCTCGGGCAGCAGCCC 
TACGCAACCCTGGGAAGATCCCCAAAACGCTGGTGCCCCATTACTGTGAGCTGGTGGGAG 
CAAACCCCAAGGTGCGTCCCAACCCAGCCCGCTTCCTGCAGAACTGCCGGGCACCTGGTG 
GCTTCATGAGCAACCGCTTTGTAGAAACCAACCTCTTCCTGGAGGAGATTCAGATCAAAG 
AGCCAGCCGAGAAGCAAAAATTCTTCCAGGAGCTGAGCAAGAGCCTGGACGCATTCCCTG 
AGGATTTCTGTCGGCACAAGGTGCTGCCCCAGCTGCTGACCGCCTTCGAGTTCGGCAATG 
CTGGGGCCGTTGTCCTCACGCCCCTCTTCAAGGTGGGCAAGTTCCTGAGCGCTGAGGAGT 
ATCAGCAGAAGATCATCCCTGTGGTGGTCAAGATGTTCTCATCCACTGACCGGGCCATGC 
GCATCCGCCTCCTGCAGCAGATGGAGCAGTTCATCCAGTACCTTGACGAGCCAACAGTCA 
ACACCCAGATCTTCCCCCACGTCGTACATGGCTTCCTGGACACCAACCCTGCCATCCGGG 
AGCAGACGGTCAAGTCCATGCTGCTCCTGGCCCCAAAGCTGAACGAGGCCAACCTCAATG 
TGGAGCTGATGAAGCACTTTGCACGGCTACAGGCCAAGGATGAACAGGGCCCCATCCGCT 
GCAACACCACAGTCTGCCTGGGCAAAATCGGCTCCTACCTCAGTGCTAGCACCAGACACA 
GGGTCCTTACCTCTGCCTTCAGCCGAGCCACTAGGGACCCGTTTGCACCGTCCCGGGTTG 
CGGGTGTCCTGGGCTTTGCTGCCACCCACAACCTCTACTCAATGAACGACTGTGCCCAGA 
AGATCCTGCCTGTGCTCTGCGGTCTCACTGTAGATCCTGAGAAATCCGTGCGAGACCAGG 
CCTTCAAGGCCATTCGGAGCTTCCTGTCCAAATTGGAGTCTGTGTCGGAGGACCCGACCC 
AGCTGGAGGAAGTGGAGAAGGATGTCCATGCAGCCTCCAGCCCTGGCATGGGAGGAGCCG 
CAGCTAGCTGGGCAGGCTGGGCCGTGACCGGGGTCTCCTCACTCACCTCCAAGCTGATCC 
GTTCGCACCCAACCACTGCCCCAACAGAAACCAACATTCCCCAAAGACCCACGCCTGAAG 
GAGTTCCTGCCCCAGCCCCCACCCCTGTTCCTGCCACCCCTACAACCTCAGGCCACTGGG 
AGACGCAGGAGGAGGACAAGGACACAGCAGAGGACAGCAGCACTGCTGACAGATGGGACG 
ACGAAGACTGGGGCAGCCTGGAGCAGGAGGCCGAGTCTGTGCTGGCCCAGCAGGACGACT 
GGAGCACCGGGGGCCAAGTGAGCCGTGCTAGTCAGGTCAGCAACTCCGACCACAAATCCT 
CCAAATCCCCAGAGTCCGACTGGAGCAGCTGGGAAGCTGAGGGCTCCTGGGAACAGGGTT 
GGCAGGAGCCAAGCTCCCAGGAGCCACCTCCTGACGGTACACGGCTGGCCAGCGAGTATA 
ACTGGGGTGGCCCAGAGTCCAGCGACAAGGGCGACCCCTTCGCTACCCTGTCTGCACGTC 
CCAGCACCCAGCCGAGGCCAGACTCTTGGGGTGAGGACAACTGGGAGGGCCTCGAGACTG 
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ACAGTCGACAGGTCAAGGCTGAGCTGGCCCGGAAGAAGCGCGAGGAGCGGCGGCGGGAGA 

TGGAGGCCAAACGCGCCGAGAGGAAGGTGGCCAAGGGCCCCATGAAGCTGGGAGCCCGGA 

AGCTGGACTGAACCGTGGCGGTGGCCCTTCCCGGCTGCGGAGAGCCCGCCCCACAGATGT 

ATTTATTGTACT^CCATGTGAGCCCGGCCGGCCCAGCCAGGCCATCTCTICGTG 

ATCAGAGCCACAATAAATTCTATTTCAC 

SEQ ID NO: 89_AA599286_H 

ATGGCCTTCATGGAGAAGCCGCCAGCCGGCAAGGTGCTGCTGGACGACACGGTGCCGCTG 

ACAGCAGCCATCGAGGCGAGCCAGAGCCTGCAGTCCCACACGGAATATATTATTCGAGTG 

CAAGGAGGAATTTCTGTGGAAAACAGCTGGCAGATTGTTAGAAGATACAGTGACTTTGAT 

TTGCTTAACAACAGCTTACAGATTGCAGGCCTAAGTCTACCTCTTCCTCCCAAAAAATTG 

ATTGGTAACATGGATCGTGAATTCATAGCTGAAAGGCAGAAAGGTCTTCAGAACTATCTC 

AACGTGATCACAACAAATCATATCTTGTCTAATTGTGAGCTGGTTAAGAAGTTTTTAGAT 

CCAAACAACTATTCCGCAAACTATACTGAGATTGCCTTGCAACAGGTTTCCATGTTCTTC 

CGATCAGAGCCAAAGTGGGAGGTGGTGGAACCTTTGAAAGACATAGGTTGGAGAATAAGG 

AAGAAATATTTCTTGATGAAGATTAAAAATCAGCCAAAGGAACGGCTAGTGTTAAGCTGG 

GCTGACCTTGGCCCAGACAAGTATTTGTCAGATAAAGATTTTCAGTGTCTAATCAAACTT 

CTGCCTTCTTGTTTGCACCCTTACATCTATCGGGTTACCTTTGCCACAGCTAATGAATCC 

TCAGCGTTGCTAATTAGGATGTTTAACGAAAAGGGAACATTGAAGGATCTGATCTACAAG 

GCAAAACCAAAAGACCCATTTCTAAAGAAGTACTGCAACCCTAAGAAGATTCAGGGCCTG 

GAACTCCAGCAT^TAAAAACATATGGACGGCAAATATTAGAGGTACTGAAGTTTCTTCAT 

GACAAGGGATTCCCTTATGGGCATCTTCACGCCTCCAATGTGATGCTCGATGGGGACACT 

TGCCGGCTGCTGGACCITGAGAATTCCTTATTGGGCCTGCCTTCCTTCTACCGATCTTAT 

TTTTCACAATTCAGGAAAATCAATACATTGGAAAGTGTGGATGTCCACTGCTTT^ 

TTACTGTATGAAATGACTTATGGACGACCGCCAGACTCGGTGCCTGTGGACTCCTTCCCT 

CCTGCCCCGTCCATGGCTGTGGTGGCCGTGTTGGAGTCTACGCTGTCTTGTGAAGCCTGT 

AAAAATGGCATGCCTACCATCTCCCGGCTCTTACAGATGCCATTATTCAGCGATGTTTTA 

CTAACCACTTCTGAAAAACCACAGTTTAAGATCCCTACAAAGTTAAAAGAGGCATTGAGA 

ATTGCCAAAGAATGTATAGAGAAGAGACTAATTGAGGAACAGAAACAGATTCACCAGCAT 

CGAAGACTGACAAGAGCTCAGTCCCACCATGGATCTGAGGAGGAAAGAAAAAAAAGAAAG 

ATTTTAGCTCGAAAGAAGTCAAAACGATCTGCTCTTGAAAATAGTGAAGAGCATTCAGCG 

AAGTACAGCAACTCCAATAATTCAGCAGGATCTGGGGCCAGCTCACCTCTCACGTCCCCG 

TCATCGCCAACTCCT^CCCTCTACATCAGGGATATCTGCATTACCTCCACCTCCTCCACCT 

CCACCACCACCAGCAGCTCCCTTGCCTCCTGCGAGCACCGAGGCACCTGCCCAGCTCTCG 

TCTCAGGCTGTGAATGGCATGAGCCGAGGGGCCTTGCTCAGCTCCATCCAGAATTTCCAA 

AAAGGAACTTTGAGGAAAGCCAAACCTGTGATCACAGTGCTCCGAAGATCGGCTGAAGCT 

TCCTGTTTACACTTGGAGGGAAAAGTTCTTTTTTATTCCTACTCACCCCTACCCCCC7VAC 

TACCCTCTTCCTGGGAAAGTAATTGCTGAGCCAGTACAGCCACAAACAGTACTATTTTGC 

AGATGCTCATGTAAGCAGCTTTTCGAGAGAAATAATTCTTTAAGCAGAATAAAGTTAGGC 

TGGCATGCAAAAAAAAAAAAAAAAAAAAA 

SEQ ID NO: 90_AA425725_H 

ATGAGCGCCAGCACGGGCGGTGGTGGGGACAGCGGCGGCAGCGGCGGCAGTAGCAGCAGC 
TCACAGGCCTCCTGCGGGCCCGAGTCCTCGGGCTCCGAACTAGCCCTGGCCACACCGGTG 
CCTCAGATGCTGCAGGGCCTTCTGGGCTCCGACGACGAGGAACAGGAAGACCCCAAAGAC 
TACTGCAAGGGCGGCTACCACCCTGTGAAGATCGGCGACGTGTTCAATGGGCGGTACCAC 
GTGGTGCGCAAACTGGGCTGGGGCCACTTCTCCACCGTCTGGCTCTGCTGGGACATCCAG 
CGCAAGCGCTTTGTGGCCCTCAAAGTGGTGAAGAGTGCGGGGCATTACACGGAGACAGCT 
GTGGATGAGATCAAGCTCCTGAAATGTGTCCGGGACAGCGACCCCAGTGACCCCAAAAGA 
GAGACCATTGTCCAGCTCATTGATGACTTCAGGATCTCAGGAGTCAATGGAGTCCATGTG 
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TGCATGGTGCTGGAGGTGCTGGGCCACCAGCTCCTCAAATGGATCATCAAGTCCAACTAC 
CAGGGCCTGCCCGTGCCCTGCGTGAAGAGCATCGTGAGGCAGGTGCTGCACGGCCTGGAC 
TACCTCCACACCAAGTGCAAGATCATCCACACGGACATCAAGCCCGAGAACATCTTGCTG 
TGTGTGGGGGACGCTTACATCAGGCGCCTGGCTGCCGAGGCCACGGAGTGGCAACAGGCA 
GGGGCGCCGCCCCCCTCCCGCTCCATAGTCAGCACTGCCCCCCAGGAGGTCTTGACCGGT 
AAGCTGTCCAAAAACAAGAGGAAGAAGATGAGGCGCAAACGGAAACAGCAGAAGCGGCTG 
CTGGAGGAGCGGCTGCGGGACCTGCAGAGGCTGGAGGCCATGGAGGCTGCCACCCAGGCT 
GAGGACTCTGGCTTGAGACTAGACGGGGGCAGCGGCTCCACATCCTCTTCAGGCTTCTCC 
GGCTCCCTCTTCTCTCCTGCCTCCTGCTCCATCCTCTCCGGCTCGTCCAATCAGCGAGAG 
ACCGGGGGCCTCCTGTCGCCTAGCACACCATTCGGTGCCTCGAACCTCCTGGTGAACCCC 
CTGGAGCCCCAAAATGCAGATAAGATCAAGATCAAGATCGCAGACCTGGGCAACGCCTGC 
TGGGTGCACAAGCACTTCACGGAAGACATCCAGACTCGGCAGTACCGGGCCGTCGAGGTG 
CTGATCGGCGCCGAATACGGCCCCCCGGCAGACATCTGGAGCACAGCCTGCATGGCCTTC 
GAGCTGGCCACTGGTGACTACCTGTTCGAGCCGCATTCTGGAGAAGACTACAGTCGTGAT 
GAGGACCACATCGCTCACATAGTGGAGCTTCTGGGGGACATCCCCCCAGCCTTCGCCCTC 
TCAGGCCGCTATTCCCGGGAGTTCTTCAACCGGAGAGGAGAGCTGCGGCACATCCACAAT 
CTCAAGCACTGGGGCCTGTACGAGGTACTCATGGAAAAGTACGAGTGGCCCCTAGAGCAG 
GCCACACAGTTCAGCGCCTTTCTGCTGCCCATGATGGAGTACATCCCCGAAAAGCGGGCC 
AGTGCCGCTGACTGCCTCCAGCT^CCCCTGGCTCAACCCCTAG 

SEQ ID NO: 91_SGK022_H 

TCTGGCCCTGTCCCTCCCCACCACCCGCCGCTGTGTCCAGACAGAGAATGTTCTAACGCT 

GGGGGCGGCTGCGGATGAAGTCCTTGGGGAGAAAAGGAGCAGGCCAAGGGCGATGGTGGA 

GTAGAGCTGCCTCTCAGAGGCAGCATGAGCTGAGAGGGTGATAGGAAGGCGGCGCTAGAC 

AGCATGGAGGACTTTCTGCTCTCCT^TGGGTACCAGCTGGGCAAGACCATTGGGGAAGGG 

ACCTACTCAAAAGTCAAAGAAGCATTTTCCAAAAAACACCAAAGAAAAGTGGC^ 

GTTATAGACAAGATGGGAGGGCCATCAGAGTTTATCCAGAGATTCCTCCCTCGGGAGCTC 

CAAATCGTCCGTACCCTGGACCACAAGAACATCATCCAGGTGTATGAGATGCTGGAGTCT 

GCCGACGGGAAAATCTGCCTGGTGATGGAGCTCGCTGAGGGAGGGGATGTCTTTGACTGC 

GTGCTGAATGGGGGGCCACTGCCTGAAAGCCGGGCCAAGGCCCTCTTCCGTCAGATGGTT 

GAGGCCATCCGCTACTGCCATGGCTGTGGTGTGGCCCACCGGGACCTCAAATGTG^ 

GCCTTGTTGCAGGGCTTCAACCTGAAGCTGACTGACTTTGGCTTTGCCAAGGTGTTGCCC 

AAGTCACACCGGGAGCTGAGCCAGACCTTCTGCGGCAGTACAGCCTATGCTGCCCCCGAG 

GTGCTGCAGGGCATTCCCCACGATAGCAAAAAAGGTGATGTCTGGAGCATGGGTGTGGTC 

CTGTATGTCATGCTCTGTGCCAGCCTACCTTTTGACGACACAGACATCCCCAAGATGCTG 

TGGCAGCAGCAGAAGGGGGTGTCCTTCCCCACTCATCTGAGCATCTCGGCCGATTGCCAG 

GACCTGCTCAAGAGGCTCCTGGAACCCGATATGATCCTCCGGCCTTCAATTGAAGAAGTT 

AGTTGGCATCCATGGCTAGCAAGCACTTGATAAAAGCAATGGCAAGTGCTCTCCAATAAA 

gtagggggagaaagcaaa 

seq id no: 92_aa060026_m sgk022_m 

cagacggagaatgttctagccctggaggcagctgtgaatgaagtccttggggggaaaaga 

agcaggccgagggcgatggtggagtagagctgcctcgcagaggcagcatgagctgagagg 

gtgacaagaaggaggcgctacacagcatggaggactttctactctccaatgggtatcagc 

tgggcaagaccattggggaagggacctactcaaaagtcaaagaagcattttccaaaaa^ 

atcaaagaaaagtggcaattaaaattatagacaagatgggagggccag;^ 

AGAGATTCCTGCCTCGTGAGCTCCAGATTGTCCGTACCCTGGACCACAAAAACATCATCC 

aggtgtatgagatgctggagtcagcagatggaaaaatctacctggtgatggaactggctg 
agggaggggatgtctttgactgtgtgctgaacggagggccacttcccgagagccgggcca 
aggccctcttccgccagatggttgaggctattcgctattgccatggctgtggcgtggccc 
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ACCGGGACCTTAAGTGTGAGAACGCCTTGTTGCAGGGCTTCAACCTGAAGCTGACCGACT 

TTGGCTTTGCCAAGGTGCTACCCAAGTCACGCAGGGAGCTGAGCCAGACCTTCTGTGGCA 

GCACAGCCTATGCCGCCCCTGAGGTGCTACAGGGCATACCCCATGATAGCAAGAAAGGTG 

ATGTCTGGAGCATGGGTGTGGTCCTGTATGTAATGCTCTGTGCAAGTCTACCTTTTGATG 

ACACAGATATCCCCAAGATGCTGTGGCAGCTVGCAGAAGGGGGTGTCCTTCCCCACTC^^ 

TGGGCATCTCAACCGAATGCCAGGACCTGCTCAAGCGGCTCCTGGAACCAGACATGATAC 

TCCGGCCTTCAATCGAAGAAGTTAGTTGGCACCCATGGCTAGCAAGCACTTGATAAAAGC 

7VATGGCAAGTCCTCCCCAATAAAGTAGGGGGAGAAAGCAAACTG 

SEQ ID NO: 93_AA3 99669_H 

CTCCCAAAGTGCTGGGATTACAGGCGTGAGCCACCGCGCCCGGCCGCACTTCATTCTCAA 

GTTTTGTGGCCAACGATGGATAGGAGGTGGATTGTGATGTATTCGGAACATGGGACCTTG 

AGGAGTTCCGTAACCAAAAGGAGT^GTAACAACAGCCAGTGGAGACAAAAAGAACTGCT 

TCTCTTTCTTTCCCCCTCCAAGTTCCTAGTGGAGGGCTGAGTCCAGCATCCCAGACTCGT 

GTGACTATATAGGCAAGCATTTGGGGACCTACTTCACTTTGATACCCTAGCCTTCAGCAG 

CTCAAGGTGTTGGCCTTTGGATAGGAGGCTTCCAAGTAGTAAAGCTCCCTGCTCTCAGCA 

AGCCCAACACCATGGGGAAGGGAGATGTCTTAGAGGCAGCACCAACCT^CCTl 

ATTCCCTCATGGATGAATATGGTTATGAGGTGGGCAAGGCCATTGGCCATGGCTCCTATG 

GGTCGGTATATGAGGCTTTCTACACAAAGCAGAAGGTTATGGTGGCAGTCAAGATCATCT 

CAAAGAAGAAGGCCrCTGATGACTATCTTAACAAGTTCCTGCCCCGTGAAATACAGGTAA 

TGAAAGTCTTGCGGCACAAGTACCTCATCAACTTCTATCGGGCCATTGAGAGCACATCTC 

GAGTATACATCATTCTGGAACTGGCTCAGGGTGGTGATGTCCTTGAATGGATCCAGCGCT 

ACGGGGCCTGCTCTGAGCCCCTTGCTGGCAAGTGGTTCTCCCAGCTGACCCTGGGCATTG 

CCTACCTGCACAGCAAGAGCATCGTGCACCGGGACTTAAAGTTGGAGAACCTGTTGCTGG 

ACAAGTGGGAGAATGTGAAGATATCAGACTTTGGCTTTGCCAAGATGGTGCCTTCTAACC 

AGCCTGTGGGTTGTAGCCCTKCTTACCGCCAAGTGAACTGCTTTTCCCACCTCAGCCAGA 

CTTACTGTGGCAGCTTTGCTTACGCTTGCCCAGAGATCTTACGAGGCTTGCCCTACAACC 

CTTTCCTGTCTGACACCTGGAGCATGGGCGTCATCCTTTACACTCTAGTGGTCGCCCATC 

TGCCCTTTGATGACACCAATCTCAAAAAGCTGCTT^GAGAGACTCAGAAGGAGGTCACTT 

TCCCAGCTAACCATACCATCTCCCAGGAGTGCAAGGTCCAACTGCTCATTGCCTGTGTGG 

CACAATGGAGAAAAACTCAGGCAAGACCTCTCTCTCCCCTGCTCTAGAACCTGATCCTCC 

AGATGCTACGCCAAGCCACTAAGCGTGCCACCATTCTGGACATCATCAAGGATTCCTGGG 

TGCTCAAGTTCCAGCCTGAGCAACCCACCCATGAGATCAGGCTGCTTGAGGCCATGTGCC 

AGCTCCACAACACCACTAAACAGCACCAATCCTTGCAAATTACGACCTGAAAATGGCTGA 

GGGAGGGGGCTAAGAGAGGAGCAAAGCAGGAGGTCTTGGGCTAAAAATCTTTTTTACCAA 

AAATAAATCTAAGTCTGATTTAGTTTCATCAAAAAAA 

SEQ ID NO: 94_AA758539_H 

GACCATTCAGACGCCTCCGGTAGTGTAAATGAGGACAATGCCTGCTGGCCCACATGACGG 
GGGGATGTAGACGGCAGCGGCGCCAGTCGCTCCTGGCACCATGGACGATGCCACAGTCCT 
AAGGAAGAAGGGTTACATCGTAGGCATCAATCTTGGCAAGGGTTCCTACGCAAAAGTCAA 
ATCTGCCTACTCTGAGCGCCTCAAGTTCAATGTGGCTGTCAAGATCATCGACCGCAGGAA 
AACACCTACTGACTTTGTGGAGAGATTCCTTCCTCGGGAGATGGACATCCTGGCAACTGT 
CAACCACGGCTCCATCATCAAGACTTACGAGATCTTTGAGACCTCTGACGGACGGATCTA 
CATCATCATGGAGCTTGGCGTCCAGGGCGACCTCCTCGAGTTCATCAAGTGCCAGGGAGC 
CCTGCATGAGGACGTGGCACGCAAGATGTTCCGACAGCTCTCCTCCGCCGTCAAGTACTG 
CCACGACCTGGACATCGTCCACCGGGACCTCAAGTGCGAGAACCTTCTCCTCGACAAGGA 
CTTCAACATCAAGCTGTCTGACTTTGGCTTCTCCAAGCGCTGCCTGCGGGACAGCAATGG 
GCGCATCATCCTCAGCAAGACCTTCTGCGGGTCGGCAGCATATGCAGCCCCCGAGGTGCT 
GCAGAGCATCCCCTACCAGCCCAAGGTGTATGACATCTGGAGCCTGGGCGTGATCCTGTA 
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CATCATGGTCTGCGGCTCCATGCCCTATGACGACTCCGACATCAGGAAGATGCTGCGTAT 

CCAGAAGGAGCACCGTGTGGACTTCCCGCGCTCCAAGAACCTGACCTGCGAGTGCAAGGA 

CCTCATCTACCGCATGCTGCAGCCCGACGTCAGCCAGCGGCTCCACATCGATGAGATCCT 

CT^GCCACTCGTGGCTGCAGCCCCCCAAGCCCAAAGCCACGTCTTCTGCCTCCTTCAA 

GGAGGGGGAGGGCAAGTACCGCGCTGAGTGCAAACTGGACACCAAGACAGGCTTGAGGCC 

CGACCACCGGCCCGACCACTUIGCTTGGAGCCAAAACCCAGCACCGGCTGCTGGTGGTGCC 

CGAGAACGAGAACAGGATGGAGGACAGGCTGGCCGAGACCTCCAGGGCCAAAGACCAT^ 

CATCTCCGGAGCTGAGGTGGGGAAAGCAAGCACCTAGCATGACAATGGCCCCGTTGTGTG 

TGGTGGGGGTCGGGGTTGGGGGGCATGGTGCAGTCGGCCTTCACGTAAACTAAGTAGGCA 

GGTAGGATCTGAAGAAGGCACAGGTGCAAGTAAAATTCGTCAATTAAACCACTATTTTGA 

TT 

SEQ ID NO: 95_AA883975_H 

ATGTCGGGAGACT^CTTCTGAGCGAACTCGGTTATAAGCTGGGCCGCACAATTGGAGAG 
GGCAGCTACTCCAAGGTGAAGGTGGCCACATCCAAGAAGTACT^GGGTACCGTGGCCATC 
AAGGTGGTGGACCGGCGGCGAGCGCCCCCGGACTTCGTCAACAAGTTCCTGCCGCGAGAG 
CTGTCCATCCTGCGGGGCGTGCGACACCCGCACATCGTGCACGTCTTCGAGTTCATCGAG 
GTGTGCAACGGGAAACTGTACATCGTGATGGAAGCGGCCGCCACCGACCTGCTGCAAGCC 
GTGCAGCGCAACGGGCGCATCCCCGGAGTTCAGGCGCGCGACCTCTTTGCGCAGATCGCC 
GGCGCCGTGCGCTACCTGCACGATCATCACCTGGTGCACCGCGACCTCAAGTGCGAAAAC 
GTGCTGCTGAGCCCGGACGAGCGCCGCGTCAAGCTCACCGACTTCGGCTTCGGCCGCCAG 
GCCCATGGCTACCCAGACCTGAGCACCTVCCTACTGCGGCTCAGCCGCCTACGCGTCACCC 
GAGGTGCTCCTGGGCATCCCCTACGACCCCAAGAAGTACGATGTGTGGAGCATGGGCGTC 
GTGCTCTACGTCATGGTCACCGGGTGCATGCCCTTCGACGACTCGGACATCGCCGGCCTG 
CCCCGGCGCCAGAAACGCGGCGTGCTCTATCCCGAAGGCCTCGAGCTGTCCGAGCGCTGC 
AAGGCCCTGATCGCCGAGCTGCTGCAGTTCAGCCCGTCCGCCAGGCCCTCCGCGGGCCAG 
GTAGCGCGCAACTGCTGGCTGCGCGCCGGGGACTCCGGCTAG 

SEQ ID NO: 96_AA905446_H 

CTGGTAGAGAACAGGGGCTGGTGCCAAGGCCCATGGAGATGAGAAAACGGAAGACAGGGA 

TCATGGAAAGAATTGTGGGGTCAGGGGACAGTGGCGGGAGGAGCTGGCTCACCACCCTGT 

GGACAAATCAGGCCTTATAATTTGTGATTCTGTGGCTTTGTCTAAAAGTCCATAAAGCAC 

CTTGATATCCAGTCTCACAGACTGCTCACAACAGTCCACAAGGCTGGTGGGGAGTGCTTC 

TTTTGAATGATATACTAACGACAAAAATAATAGAAGTGAACATTCTTTGCAATGTCC^ 

CAGCTAGACACACTTAAGACCATTAAGAAAGCCAAGAAATAAGACCCAGACAAGGTGGGC 

AGAAGTTGGAAGGCAGGAGACAGGTGTGAGGAGGTGGGCCTTTCTGATCTGCCAGCCCAT 

CTCTCCTCCCCTTACTTCCTCAGAGTTTATCCAGAGATTCCTCCCTCGGGAGCTCCAAAT 

CGTCCGTACCCTGGACCACAAGAACATCATCCAGGTGTATGAGATGCTGGAGTCTGCCGA 

CGGGAAAATCTGCCTGGTGATGGAGCTCGCTGAGGGAGGGGATGTCTTTGACTGCGTGCT 

GAATGGGGGGCCACTGCCTGAAAGCCGGGCCAAGGCCCTCTTCCGTCAGATGGTTGAGGC 

CATCCGCTACTGCCATGGCTGTGGTGTGGCCCACCGGGACCTCAAATGTGAGAACGCCTT 

GTTGCAGGGCTTCAACCTGAAGCTGACTGACTTTGGCTTTGCCAAGGTGTTGCCCAAGTC 

ACACCGGGAGCTGAGCCAGACCTTCTGCGGCAGTACAGCCTATGCTGCCCCCGAGGTGCT 

GCAGGGCy^TTCCCNNCAAGATGCTGTGGCTVGCAGCAGAAGGGGGTGTCCTTCCC 

TCTGAGCATCTCGGCCGATTGCCAGGACCTGCTCAAGAGGCTCCTGGAACCCGATATGAT 

CCTCCGGCCTTCAATTGAAGAAGTTAGTTGGCATCCATGGCTAGCAAGCACTTGATAAAA 

GCAATGGCAAGTGCTCTCCAATAAAGTAGGGGGAGAAAGCAAACCC 
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SEQ ID NO: 97_H29974_H 

TTACAGCCTGTTGGCGGAGATCGGGCGCGGCAGCTACGGCGTGGTTTATGAGGCAGTGGC 

CGGGCGCAGCGGGGCCCGGGTGGCGGTCAAGAAGATCCGCTGCGACGCCCCCGAGAACGT 

GGAGCTGGCGCTGGCTGAATTCTGGGCCCTCACCAGCCTCAAGCGGCGCCACCAGAACGT 

CGTGCAGTTTGAGGAGTGCGTCCTGCAGCGCAATGGGTTAGCCCAGCGCATGAGTCACGG 

CAACAAGAGCTCGCAGCTTTACCTGCGCCTGGTGGAGACCTCGCTGAAAGGAGAAAGGAT 

CCTGGGTTATGCTGAGGAGCCCTGCTATCTCTGGTTTGTCATGGAGTTCTGTGAAGGTGG 

AGACCTGAATCAGTATGTCCTGTCCCGGAGGCCAGACCCAGCCACCAACAAAAGTTTCAT 

GCTACAGCTGACGAGCGCCATTGCCTTCCTGCACAAAAACCATATTGTGCACAGGGACCT 

GAAGCCAGACAACATCCTCATCACAGAGCGGTCTGGCACCCCCATCCTCAAAGTGGCCGA 

CTTTGGACTAAGCAAGGTCTGTGCTGGGCTGGCACCCCGAGGCAAAGAGGGCAATCA^ 

CAAOU^AAATGTGAATGTGAATAAGTACTGGCTGTCCTCAGCCTGCGGTTCGGACTTCTA 

CATGGCTCCTGAAGTCTGGGAGGGACACTACACAGCCAAGGCGGACATCTTTGCCCTGGG 

CATTATCATCTGGGCAATGATAGAAAGAATCACTTTTATTGACTCTGAGACCAAGAAGGA 

GCTCCTGGGGACCTACATTAAACAGGGGACTGAGATCGTCCCTGTTGGTGAGGCGCTGCT 

AGAAAACCCAAAGATGGAGTTGCACATCCCCCAAAAACGCAGGACTTCCATGTCTGAGGG 

GATCAAGCAGCTCTTGAAAGATATGTTAGCTGCTAACCCACAGGACCGGCCTGATGCCTT 

TGAACTTGAAACCAGAATGGACCAGGTCACATGTGCTGCTTAAAATTCAGGGCTAAGCAT 

TTTGGGTGATTTTAAACTAGGTCGATTCCTCGGGACCCACAGTCTCACCACGTCTCCTCC 

AGAGGACGGCAGAGGGTACAGGTGGTGGCCTGGCCGGTTGGCGATCTCCCGACAGCTGGA 

TCCGGCAATGTGAAGCTTTTGTTTGGGTTTCCCCGCTTCTTTTTAGTTTTGCTTTATTTN 

TNNCCTTTTCTTTTCTTTTTTTNTTTNCCACim^CCTTTTTTTAAATTTA;^ 

ACTTCAGAAGAGCAGGACACAATGCTGTGGACAGGCACCAATTTCTTTAAAGAAATTC 

TGTGGGCAAGGCATATGTGTAAATTTCACTTTTACTTTTTATAAGGGGTTAGGGAGCTAT 

TTTTGGTTTTGTCCTTCACTTTCCCTCTGTCTTCCTTCTTTATACTTTTCTCAGTTCTAC 

TTATGACACCTCACTTCCCTAGAGAAGGCCTGCCTCCCCATAGGGAATCTGGGGGTANCT 

TCTGGAACGGGGC6TGAGGANACAAGGAGCCTCTGGGCCACNCCTCCCTACCAGATGCAG 

GAACTCCTGGACTCCTTGGTGGGCTGGCCCTGGCTAGCCCTTGGGCCTCGGAGATGATCA 

GAGGTGAAGAACCGCC 

SEQ ID NO: 98_AA498104_M H29974_M 

CCGTTGCTGCTCCCCCCGCCCCCCGCAGCCATGGAAACGGGGAAAGAGAACGGAGCCCGC 

AGAGGGACAAAAAGCCCGGAGCGGAAAAGGCGAAGCCCAGTCCAGCGGGTACTGTGCGAG 

AAGCTGAGGCCGGCGGCCCAGGCCT^TGGATCCGGCTGGGGCCGAGGTCCCGGGCGAGGCC 

TTCCTGGCCCGGCGGCGGCCGGATGGCGGCGGCGGGGATGTTCCTGCACGGCCGCGCTAC 

AGCCTCTTGGCGGAGATCGGGCGCGGCAGCTACGGCGTGGTTTATGAGGCTGTGGCTGGG 

CGCAGTGGGGCCAGGGTGGCAGTCAAGAAGATCCGCTGCGACGCTCCCGAGAACGTGGAG 

TTGGCACTAGCAGAATTCTGGGCCCTCACCAGTCTCAAGCGGCGGCACCAGAATATCGTG 

CAGTTTGAGGAGTGCGTCCTACAGCGCAACGGGTTAGCCCAGCGCATGAGTCACGGCAAC 

AAGAACTCACAGCTTTACCTGCGCCTGGTGGAGACCTCGCTCAAAGGAGAAAGGATCCTG 

GGCTATGCTGAGGAGCCCTGCtATCTCTGGTTTGTCATGGAGTACTGTGAAGGTGGAGAC 

CTCAATCAGTATGTCCTGTCCCGGAGACCTGACCCAGCCACCAACAAAAGTTTCATGCTA 

CAGCTTACAAGCGCCATTGCCTTCCTGCATAAAAACCACATCGTGCACAGGGACCTAAAG 

CCAGACAACATCCTGATCACAGAGCGGTCTGGCACCCCCATCCTCAAGGTGGCAGACTTT 

GGACTGAGCAAGGTCTGTGCAGGGCTGGCACCCCGAGGCAAAGAGGGCAATCAAGATAAC 

AAAAATGTGAATGTGAATAAATACTGGCTGTCCTCAGCTTGTGGCTCAGACTTCTACATG* 

GCrCCCGAAGTCTGGGAGGGACACTATACAGCCAAGGCGGACATCTTTGCTCTGGGCATT 

ATCATCTGGGCAATGATAGAAAGAATrACCTTTATTGACTCTGAAACCAAGAAGGAGCTC 

CTGGGGACCTACATTAAGCAAGGGACTGAGATCGTCCCTGTTGGTGAGGCGCTGCTAGAA 

AACCCAAAGATGGAGTTGCATATCCCCCAGAAACGTAGGACTTCCATGTCTGAGGGGGTC 
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AAGCAGCTCTTGAAAGACATGTTAGCTGCTAACCCACAGGACCGACCTGATGCTTTTGAA 
CTTGAAACCCGAATGGACCAGGTCACATGTGCTGCTTAAACTCCAGGGCTGAACGTCTTG 
GGTGTTTTTAAACTAGGTCGATCCTTCGGGACCCACAGTCTCATCGTGTCTCGGACAGGA 
TGGCAGAGGGTACAGGTGGTGGTGATCTCCTGACAGCTGGACCTCCCACAATGTGAAGCT 
CACGCTTGGGCTGCCCACTCTACCCrTCTCTTTCTCCTTCAGTAGAATAATAATTGTTTT 
TCTAAACATTAAACCATCAAGACTTCTGAAGAGCAGAAGGCTACACTCTG 

seq id no: 99_aa215311_h 

cgrccgcgctacggaaagccggaggggggcggggccgtcggcgtaagggggtgtgtccgc 

gcgcaccacgggggcgcgcgccggctgctgactggaggcggcggcagcggaggcgcgagc 

tgcccgataatggcggcctgc7vgagcccatgagagggagaagcggcagcgtctaccctga 

gaaacctcgaccttgaagatggtgagtagccagccaaagtacgatctaatacgggaggta 

ggccgaggtagttacggtgttgtgtatgaagcagtcatcagaaagacctctgcacgggtg 

gcagtgaagaaaattcgatgtcacgcacctgaaaatgttgaactagcccttcgtgagttc 

tgggcactaagcagtatcaagagccaacatccaaatgtgattcacttggagg;^ 

ctacaaaaggatgggatggtgcaaaagatgtcccacggctctaattcttccctttattta 

cagcttgtagaaacttcattaaaaggagaaattgcctttgatcccagaagcgcctattat 

ttgtggtttgtgatggatttttgtgacggaggagatatgaatgagtatctgttgtccagg 

aagcccaatcgtaaaactaacaccagcttcatgcttcagctgagcagtgccctggctttc 

ttgcataaaaaccagatcatccaccgagatcttaagcctgataacatcctgatttctcaa 

accaggttggataccagtgacttggaacctaccctcaaagtggctgattttggtctaagt 

aaagtttgttcagcctctgggcagaacccagaagaacctgtcagtgtaaacaagtgtt^ 

ctttccacagcatgtggaacagatttttacatggctcctgaagtttgggaaggacattac 

acagcaaaagctgacatctttgctctggggattatcatctgggcaatgctggaaagga 

ACATTCATAGACT^CAGAGACAAAGAAGGAACTCTTGGGGAGTTATGTAAAACAAGGAAC^ 

gagattgtgcctgttggggaggcacttctggaaaatcccaaaatggaacttctcattcct 
gtgaagaaaaaatctatgaatgggcgaatgaaacaactgattaaggaaatgctggctgca 
aaccctgaggatcgtccagatgcttttgaactagaactcagattagtacaaattgcattt 
aaagatagcagctgggaaacgtgacacatattatttgcaaatacgatggatgatatgctg 

CTTCTGTTTAACAGTGATGCAACATTATGTGGCTGAAAAAGAATATAAAAAGCTAGACTC 
TACCCTCTAAGGGTTTAGATTTTTTGTGGGATTTTTTTTTTCCTCT^TTTTTCTTAAATCC 
AAGTTGGCCGTTTTATTAGTATGTTTCAAATGTGTATTACCAATGTGGGTGTAAATTTTT 
AAAAAATGATTATTGATAGAAGTTTGGCT^GGAAAATTCTTTAAGAGCTAACAAGAGAAGA 
GAGTCCAGTTTTCTGGAAATATGTCTTTAAGTATTTTAGACATTCCTCGTCAGTATTAGG 

aatttccatgggaaaagaggtttgcatgctggtaatgctvacctttgaaactttgta 

aaacatatatgtatatatttatgtatatgtaagtatgtgaatgtgcgcattttgcattcc 

atatgaaaaaaatgccticgtctgtttaaattatttgatgtaggtttgggtttttgagatt 

TGCTGGTGAAGTCAGTGACGAAAAATAAACCTTCCCTTATCTTCCTACTCTGCCCCTCCC 

cctaatgaaatcatattaagtngtttttcc™ntttttttgtaatatacagcttttttt^ 

TAAGGCATC7VTTTTCGAGGGTCTAAAATTATCTGGTAAAACAAATGAAATTAAGTGATCC 

aaagctgctgaagtatgtttgaactctccagtgccctatagctgcaagagttgaattagt 

CATGCAGTCATATGGCAGCAGGTTGGTGATT 
SEQ ID NO: 100_AA018361_H 

GCGGGGCCTCCGTATCCCCACGTGGGCCCTGCAGGAACTGGCGGGGCGCGTGACCCGGCG 
AGGCCCAGAGACAGGGGAGGGGCGCCGGGAGCCGGGCGGATCCGCGTCCCCGATGCGCGC 
TGCATTTCCGGCGGGCGGCGCTGGGGGCAGCGTGGAGCCACCCAGTGCTCGGCCCGCCCC 
GCAACCCGCCGGAACCGCCGCCCGCAGCGAGGAAGCGCCCGCGCGGGCGCAGGCGGCCGG 
AATGGCGGGGCCCGGCTGGGGTCCCCCGCGCCTGGACGGCTTCATCCTCACCGAGCGCCT 
GGGCAGCGGCACGTACGCCACGGTGTACAAGGCCTACGCCAAGAAGGACACTCGTGAAGT 
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GGTAGCCATAAAGTGTGTAGCCAAGAAAAGTCTGAACAAGGCATCGGTGGAGAACCTCC 

CACGGAGATTGAGATCCTCAAGGGCATTCGACATCCCCACATTGTGCAGCTGAAAGACTT 

TCAGTGGGACAGTGACAATATCTACCTCATCATGGAGTTTTGCGCAGGGGGCGACCTGTC 

TCGCTTCATCCATACCCGCAGGATTCTGCCTGAGAAGGTGGCGCGTGTCTTCATGCAGCA 

ATTAGCTAGCGCCCTGCAATTCCTGCATGAACGGAATATCTCTCACCTGGATCTGAAGCC 

ACAGAACATTCTACTGAGCTCCTTGGAGAAGCCCCACCTAAAACTGGCAGACTTTGGTTT 

CGCACAACACATGTCCCCGTGGGATGAGAAGCACGTGCTCCGTGGCTCCCCCCTCTACAT 

GGCCCCCGAGATGGTGTGCCAGCGGCAGTATGACGCCCGCGTGGACCTCTGGTCCATGGG 

GGTCATCCTGTATGAAGCCCTCTTCGGGCAGCCCCCCTTTGCCTCCAGGTCGTTCTCGGA 

GCTGGAAGAGAAGATCCGTAGCAACCGGGTCATCGAGCTCCCCTTGCGGCCCCTGCTCTC 

CCGAGACTGCCGGGACCTACTGCAGCGGCTCCTGGAGCGGGACCCCAGCCGTCGCATCTC 

CTTCCAGGACTTCTTTGCGCACCCCTGGGTGGACCTGGAGCACATGCCCAGTGGGGAGAG 

TCTGGGGCGAGCAACCGCCCTGGTGGTGCAGGCTGTGAAGAAAGACCAGGAGGGGGATTC 

AGCAGCCGCCTTATCACTCTACTGCAAGGCTCTGGACTTCTTTGTACCTGCCCTGCACTA 

TGAAGTGGATGCCCAGCGGAAGGAGGCAATTAAGGCAAAGGTGGGGCAGTACGTGTCCCG 

GGCTGAGGAGCTCAAGGCCATCGTCTCCTCTTCCAATCAGGCCCTGCTGAGGCAGGGGAC 

CTCTGCCCGAGACCTGCTCAGAGAGATGGCCCGGGACAAGCCACGCCTCCTAGCTGCCCT 

GGAAGTGGCTTCAGCTGCCATGGCCAAGGAGGAGGCCGCCGGCGGGGAGCAGGATGCCCT 

GGACCTGTACCAGCACAGCCTGGGGGAGCTACTGCTGTTGCTGCGGAGCCCCCGGGCCGG 

AGGCGGGAGCTGCTTCACACTGAGGTTCAGAACCTCATGGCCCGAGCTGAATACTTGAAG 

GAGCAGATGAGGGAATCTCGCTGGGAAGCTGACACCCTGGACAAAGAGGGACTGTCGGAA 

TCTGTTCGTAGCTCTTGCACCCTTCAGTGACCCTAGAAGAATGATTGGACAGATGTGAGC 

CATCTGGAGCAGAGGGGCACTAACCCAGGCTGACGCCAAGAATGAAGTGGCCCACTGCAG 

CCCTGGCGAGCAGGCTTCTTGGATGGACAGTGCTGAGACCCCCATATCCCAGAGTCCCGA 

GCCTCCCTCAGGTTACTCTGCACCCCACAGATGGTTTGATG6CTGTGCTGTATACTGGAG 

GGGAGGGCAGGACTCTGGGAGAACAGCT^CTTCTTTCATGAGACCTTTGTTACTCGGTGGT 

TACTGGGTCCTGTGCCTGTCCGTTTTGGGGCATGCAGCCCTCTATCATTTTTGGCTCCGA 

GAAGAGGGCAAGGGGCCCCCGCAGGGTACTTCTGTGCTTGCCCTCGCCCTGCCAGCAGGC 

AGCTGTGCCCCTGGCCTGCCCTTCCCGGGACCCCTTATTCCAACTCAGCTCCTCTTTGCA 

CTGGAATGGGGCACTCCAACACCCCTCAGGGACCACCCTCCCCACAGTATGCT^CTC^^ 

CCACAGAACCCACCAGTCTTTCTGGGAACTCACACCTGCCCGCCATCTTGGTACTTTAGG 

TTAATCCCTGAAGCATGAAAGCTGGATCTTTTGGGGTTTAAGAAGCCCAAGCCTTGTTCC 

TGCCCTGGCCTAGGGAGCACTCAGGAGGGTTCCTTGGTCCTCATCTCTCCCACCTCCGTT 

CCCTCTGGGCCCCACACTAGCCACAGCGCGGGCCTTGTGCTGGAGTTTGAGCCTGGGACA 

GGGAGAGGGAGGCTTGGAGACAGTCTGACCCAGTGCCCTCTAGGCCACCCACTTCTAGGC 

CTGCCCTGCCGCCGTGGAGCCCTGGGCAAGCTCTTTCCCCTTTCTGGGCCTGGGTCTCCC 

CATCTCTTCAATGGGGCTGATACCTTCACAGCCCACAGCATGGGCACTTATGAGGACAAA 

GTGAATTTAACCTGGAAAAGAATGTATTTGAGAGTTTCTTTTAAATAATCAGCGGGTGTT 

GGTGATTTGTAGCCCTTCTGCCCTTAAATGCTTCCTTGGGCAAGAGCTGTCTGTCCTCCC 

TGCAGGAGGCTGAGTGTGAAGAGTATCATTCATTGTTTCTCTATTAAATTATTTTCTCT 

SEQ ID NO: 101_AA311714_H 

TGGACCTGTCCTGAGGCAGAGGCCGAGATGCGCGCAACCGCGGGAGCAGCCAAGTGGACT 
GGACTCTTTTCTTGACTTAGCTACCAGGAGCTAGAGATGCTGTTATTCrATCGTATGTGA 
GAAGTCGGCCCAGAGATGGAAAACTTTATTCTGTATGAGGAGATCGGAAGAGGAAGCAAG 
ACTGTTGTCTATAAAGGGCGACGGAAGGGAACAATCAATTTTGTAGCCATTCTTTGTACT 
GATAAGTGCAGAAGGCCTGAAATAACCAACTGGGTCCGTCTCACCCGTGAAATAAAACAC 
AAGAATATTGTAACTTTTCATGAATGGTATGAAACAAGCAACCACCTCTGGCTAGTGXAT 
GAAAACCTCCCAGAAGATGTTGTGAGAGAATTTGGAATTGACCTGATTAGTGGATTACAT 
CATCTTCATAAACTTGGCATTCTCTTTTGTGACATTTCTCCTAGGAAGATACTCTTGGAA 
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GGGCCTGGCACACTGAAGTTTAGCAACTTTTGCTTGGCAAAAGTGGAAGGTGAAAATTTG 

GAAGAGTTCTTTGCTTTGGTGGCAGCAGAGGAAGGAGGAGGTGATAATGGGGAAAATGTC 

CTGAAGAAAAGCATGAAAAGTAGAGTCAAAGGATCTCCTGTATATACAGCACCAGAAGTT 

GTGAGGGGTGCTGACTTTTCCATCTCCAGTGACCTCTGGTCTTTGGGCTGTCTGCTTTAT 

GAAATGTTTTCAGGAAAACCTCCATTCTTCTCAGAAAGTGTTTCAGAATTAACTGAAAAG 

ATCTTATGTGAAGATCCTTTGCCACCTATTCCGAAAGATTCTTCTCGTCCTAAAGCTTCT 

TCAGATTTTATTAATTTGCTTGATGGGTTACTTCAAAGAGATCCTCAGAAAAGATTGACT 

TGGACT^GGCTACTGCAGCATTCATTTTGGAAGAAAGCTTTTGCTGGAGCAGATCAGGAA 

TCAAGCGTCGAAGATCTCAGTCTCAGCAGAAACACTATGGAGTGTTCTGGGCCACAAGAT 

TCCAAGGAGCTTTTGCAGAACTCTCAGAGTAGACAAGCAAAAGGGCACAAGAGTGGTCAA 

CCACTAGGTCTICTCTTTCAGACTAGAAAATCCAACTGAGTTTCGGCCTAAGAGTACTCTT 

GAGGGTCAATTGAATGAATCCATGTTTCTTCTCAGTTCTCGTCCTACTCCCAGAACTAGC 

ACTGCAGTGGAAGTAAGTCCTGGTGAGGATATGACTCACTGTTCACCACAGAAGACTTCT 

CCTCTGACCAAGATTACAAGTGGACyVCCTGAGTCAGCAGGACCTGGAATCCCAGATGAGA 

GAGCTTATCTACACGGACTCAGATCTTGTTGTCACCCCCATTATCGACAATCCAAAGATA 

ATGAAACAGCCACCAGTTAAATTTGATGCAAAAATATTGCATCTACCAACATATTCAGT 

GATAAGTTATTATTTCTGAAAGATCTU^GATTGGAATGACTTTTTGCAACAAGTGTGCTCG 

CAGATCGACTCCACTGAGAAGAGCATGGGGGCCTCCCGA6CCAAGCTGAATCTCCTTTGC 

TATTTGTGCGTGGTGGCTGGTCACCAGGAGGTGGCCACCAGGCTCCTCCATTCCCCCCTG 

TTCCAATTGCTAATCCAGCATTTGCGGATAGCTCCT^CTGGGATATACGGGCC^ 

GCTCACGTGATTGGTTTACTGGCTTCGCACACAACTGAGCTCCAGGAAAATACACCTGTT 

GTTGAGACTACAAGCTCCATTGGAATCGGGATTTTGAACTGTCTTGTTCAACACTCCACT 

CCAGTGCCTAGACAGTGCCTTGTGTATGTATAGATACTGACAAATATTTCAAAATAAATA 

AAACTGTATCAGCATT 

SEQ ID NO: 102_SGK384_H 

TCTTTGGCCCACGTGCTGAGGGCGCGGCAGATCCTGACGGAGCCAGAAGTGCGCGACTAC 
CTGCGGGGCCTGGTCAGCGGCCTGCGCTACCTGCACCAGCGGTGCATCCTGCACCGC 

SEQ ID NO: 103_AA210451_M SGK384_M 

GGTCTGCTGCATGGATAATGGACTGGAACACAGAAAGACCATGCAGGGTTCGGCTGTAGA 
AGGCCAGTATCTCCAGAGGCCAGAAGACACCATCAGATCTCCTGGGACTGGAGTTATAGA 
GGlTGTGAGCTGCCATGTTGAACC7U\.GC:AGGTCACrrGAGGGACAC^ 

AACCCTGCTGGGAGAAAAAAGAAACTGCTGAAGGGACTGACATGGGACAGCAACATGGAA 
CCAGGAATGGTCTCACGCATAGAGAGCTCCCCCGGGGCGTGGGGCTGCTGCTCGCCATGG 
CCCTTATGAACGTGGCGCTCTACCTCTGCCTTGATCAGCTTTTCATCTCCCCTGGACGAT 
CCACCGCGGACTCTAGGCGCTGTCCTCCGGGCTACTTCAGAATGGGGCGGATGAGAAACT 
GCTCACGCTGGCTGTCCTGTGAAGAGCTGAGGACAGAAGTCAGGCAGCTGAAGCGCGTTG 
GGGAGGGAGCCGTGAAGAGAGTCTTTCTGTCTGAATGGAAGGAACACAAAGTCGCTCTCT 
CCCGGCTCACCAGGCTGGAGATGAAGGAGGACTTCCTGCATGGGCTGGAGATGCTGAAGT 
CTCTACAGAGTGAGCACGTGGTCACGCTGGTGGGCTACTGTGAGGAAGATGGCACTATTC 
TCACCGAATATCACCCCTTAGGTTCCTTGAGCAACCTGGAAGAAACACTAAACCTTTCAA 
AGTACCAAGACGTGAACACTTGGCAGCACAGGCTGCAGCTGGCCATGGAGTACGTCAGC^ 
TCATTAACTATCTGCATCACAGCCCCCTGGGCACGAGGGTCATGTGTGACTCTAACGACC 
TGCCCAAAACATTGTCCCAGTACCTGCTAACAAGTAACTTCAGCATTGTGGCAAACGACC 
TGGACGCTCTGCCCCTGGTAGACCATGACTCTGGGGTACTTATAAAGTGTGGCCACAGAG 
AGCTCCATGGGGATTTTGTGGCTCCAGAGCAGCTGTGGCCCTACGGAGAAGACACGCCCT 
TCCAAGACGATCTCATGCCTTCCTACAATGAGAAGGTTGACATCTGGAAGATTCCAGATG 
TCTCCAGTTTCCTCTTGGGGCACGTGGAAGGGAGTGATATGGTTAGATTCCATTTGTTTG 
ATATCCATAAGGCGTGCAAGAGCCAGATCCCGGCAGAAAGACCCACTGCTCAGAACGTGC 
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TAGACGCTTACCAGAGGGTTTTCCTVTTCJ^CTCCGAGACACTGTGATGTCGCAGACGAAA 

AAATGCTGTAAAAATGAGCCATCGAGTGACGTGCTTGATGGCTGAATGGCATCCCAGCTG 

TTCCGCTCTTGATGATGGAAGAGCTTTGCATGGATGGATGTTGACCCTGGCTGTTCAGCC 

ACGTAGGCCTCCTCTACGTCTGCCTGCATGTTTGAGTGTTCTGCTCTCCTGGCAGCCCGG 

ATGGAAGCTGCCAAGCGAGAAAGCCTGGCTTCAGGATGCTCCCTGGTGAAGATGCAGAGG 

ATTCTGGATCTGCATAGTTTCAAGGGAGTGATCAAACGGTGACCTTGAAGACATGCTGCC 

TGCCTTGGTAACTTTTTATAGACTAGTAGGAAACAGAAATCTTTTGGGGGAGGGGGGGAC 

AACCCACTAGTTCCTCAGAGACAATTTCTTCTCATTCAGAAAGCCCTGTTGGAAGCTGGG 

GATGTTTTAACTCCGTGGCAGGGCACTTGCCTAGTTGTGTGCAAAGCCTTGGATCTGACC 

CATGGCATGTGCACACACACAAATGCTCAAAGAAAATCCCAGACGCCAGAAGTGTGCCCC 

TTTCTTGTCAATAAGGTCATTGTCAGTACCGGAGATGATTTTTTTTATGAAGCGTTTATG 

CTGACTCGTGTCACTGAGCCAAGTGTGCATGGTCGTTAGCTACTTTGTGGGTTCTTCTTT 

CTTTCTACCCTACTTCTTCCCTTTCCACCCCCTAACACTAGATAGGAGAGAGGAGAGAGA 

AAGGAAAGTGGGCACTGTTATATTGTTGGACGACTTCTTGCTGATTAAGGGGTGTCGAGT 

TCCTTGGAGCAATGATCTTTGCTGCCAAGATATCTCATTTCTTCTTGTTTCTTCTTCGCC 

CACGACCACTTCACAAACACCGACCT^CAGCAAACAACy^CCCACCCC^ 

CCCTAGCACTTATGTACTTCTGAAAAGTCCCCAGAAATTCCAATCATCACACACTCAGAG 

AAACTGTCTGCTGCTGGCAAAACTACACCCCTGCTAGAGCATGAGGCAAATCATAGTCAG 

CTGCTGTGGACAGTCTGAAGCAGCCTGGCATCCCACACCTGAGATTAAAACAAAAACA^^ 

CTTACCTGTGTTTTGTTTTTGTTTTAAGAAACCAAAGTGCACCAAGATAGCATGCTCT^ 

AGATTGTGGCTGTCTAGAGATTTTTGGAACAGCAAGTTGAAGGAACTTTCTTACCTGCCT 

TGAATGGTGCTTTGAACTTCCTGCTGACCTGGAGTTTCTGTGTGAATATTTCTATCCAGT 

GTCCCCCTGTACCGGAAAGTACAAAGTCTGCTCTGGGCTTGCATGCCTGAACACTTTAAA 

ACACTGTGGAGCCAGGAATAATGGTACCCACCTGTAATCCCAGCACCTGGGAGACAGGAG 

GAACCAGGAGTTCAGGGTTATCCTGGGCTATATACCGTGACCCTGTCTACCCCCACACCC 

CAATAAAAAAACAAAAAGGTC 

SEQ ID NO: 104_SGK071_2_H 

GAGGTGGTGGCTGTGCAGATGATGGTGGAATGCATGGATGACCATTACGCCAGTCAGGCC 

CTGGAGGAGCTGATGCCACTGCTGAAGCTGCGGCACGCCCACATCTCTGTGTACCAGGAG 

CTGTTCATCACGTGGAATGGGGAGATCTCTTCTCTGTACCTCTGCCTGGTGATGGAGTTC 

AATGAGCTCAGCTTCCAGGAGGTCATTGAGGATAAGAGGAAGGCAAAGAAAATCATTGAC 

TCTGAGTGGATGCAGAATGTGCTGGGCCAGGTGCTGGACGCGCTGGAATACCTGCACCAT 

TTGGACATCTVTCCACAGGAATCTCAAACCCTCCAACATCATCCTCATCAGCAGT^ 

TGCT^CTGCAGGACCTGAGTTCCAATGTGCTAATGACAGACAAAGCCTU^ 

CGTGCGGAGGAAGACCCCTTTCGTAAGTCCTGGATGGCCCCTGAAGCCCTCAACTTCTCC 

TTCAGCCAGAAATCAGACATCTGGTCCCTGGGCTGCATCATTCTGGACATGACCAGC^ 

TCCTTCATGGATGGCACAGAAGCCATGCATCTGCGGAAGTCCCTCCGCCAGAGCCCAGGC 

AGCCTGAAGGCCGTCCTGAAGACAATGGAGGAGAAGCAGATCCCGGATGTGGAAACCTTC 

AGGAATCTTCTGCCCTTGATGCTCCAGATCGACCCCTCGGATCGAATAACGATAAAGGAC 

GTGGTGCACATCACCTTCTTGAGAGGCTCCTTCAAGTCCTCGTGCGTCTCTCTGACCCTG 

CACCGGCAGATGGTGCCTGCGTCCATCACCGACATGCTGTTAGAAGGCAACGTGGCCAGC 

ATTTTAGGTGATGCTGGGGACACAAAGGGGGAGCGTGCCCTGAAGCTCCTGTCCATG^ 

TTGGCATCCTATTGTTTAGTTCCAGAGGGTTCATTATTTATGCCCCTGGCCTTGCTCCAC 

ATGCACGACCAGTGGCTCAGCTGTGACCAGGACAGAGTCCCTGGGAAGAGAGACTTTGCC 

TCCCTGGGGAAACTAGGGAAGCTGTTGGGCCCCATCCCAAAGGGTCTGCCGTGGCCCCCG 

GAGCTGGTGGAGGTGGTGGTCACGACCATGGAGCTACATGACAGGGTCCTCGATGTCCAG 

CTGTGTGCCTGCTCCCTGCTGCTGCACCTCCTGGGCCAAGCGCTGGTGCACCACCCGGAA 

GCCAAGGCTCCCTGCAACCAAGCCATCACCTCCACCCTGCTGAGTGCTCTTCAGAGCCAC 

CCCGAGGAGGAGCCACTTCTTGTCATGGTCTACAGCCTGCTAGCCATCACCACAACCCAG 
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GAGTCAGAGTCACTGTCAGAGGAGCTGCAGAACGCTGGGCTGCTGGAGCACATCCTGGAG 
CACCTCAACAGCTCCCTCGAAAGCAGGGACGTCTGCGCCAGCGGCCTGGGCCTGCTCTGG 
GCCCTCCTGCTGGACGACCCCATCTTGGCACTCCAGCGCCCCAGGAAAAAGAGAGCTCCA 
AACCACGGAAAGCCCGGGAAACCCAAGAACCCTGCCAGCACCCAAAGTATCATTGTGAAC 
AAGGCCCCCTTGGAGAAGGTCCCGGACCTCATCAGCCAGGTGTTGGCCACCTACCCTGCG 
GATGGGGAAATGGCAGAAGCCAGCTGCGGAGTCTTCTGGCTGCTGTCCCTGCTGGGCTGC 
ATCAAGGAGCAGCAGTTTGAACAAGTGGTGGCGCTGCTCCTGCAAAGCATCCGGCTGTGC 
CAGGACAGAGCCCTGCTGGTGAACAATGCCTACCGGGGACTGGCCAGCCTGGTGAAGGTG 
TCAGAGCTGGCGGCCTTCAAGGTGGTGGTGCAGGAGGAGGGCGGCAGTGGCCTCAGCCTC 
ATCAAGGAGACCTACCAGCTCCACAGGGACGACCCGGAGGTGGTGGAGAACGTGGGCATG 
CTGCTGGTCCACCTGGCTTCCTATGAGGAGATCCTGCCGGAGCTGGTGTCCAGTAGTATG 
AAGGCCCTGCTCCAGGAGATCAAGGAGCGCTTCACCTCCAGCCTGGTGAGTGACAGCAGC 
GCCTTCAGCAAACCAGGCCTCCCTCCAGGTGGAAGCCCCCAGCTGGGGTGCACCACGTCT 
GGGGGACTGGAATAG 

SEQ ID NO: 105_AA118352_M SGK071_M 

CAGAAGAAGACCCCTGCCAGAAGTCCTGGATGGCTCCTGAAGCTCTCAAATTCTCCTTCT 

CCACCAAATCCGACATCTGGTCTCTGGGCTGCATCATTCTAGACATGGCCACTTGCTCCT 

TCCTGAACGACACAGAAGCCATGCAACTGCGGAAGGCCATCCGCCATCATC^ 

TGAAGCCCATCCTGAAAACCATGGAGGAGAAGCAAATCCCTGGTACAGATGTCTACTATT 

TGCTTCTGCCCTTCATGTTGCATATCAACCCCTCCGATCGACTGGCAATCAAGGATGTGA 

TGCAAGTCACCTTCATGAGCAACTCCTTCAAAAGCTCCTCTGTTGCGCTGAATATGCAGC 

GGCAGAAGGTCCCCATCTTCATCACTGACGTGCTGCTTGAAGGCAACATGGCCAACATCT 

TAGGCAGCTGGCTGTGTGCTTCCTTTGTGAACGACAGCAGGCACTGTGACTCAGGGATTG 

GCTCGCAGAGACTTGGGTTTGATTTTCAGTCAGTCTCTTGGACAGAGCACCCTCTGAAAG 

ATGTCATGCAGAATTTCTCCAGTCGACCAGAGGTCCAGCTCAGAGCCATTAACAAGTTGT 

TGACAATGCCAGAGGACCAGCTAGGGCTGCCATGGCCCACAGAGCTGCTGGAAGAGGTGA 

TCAGCATCATAAAGCAGCATGGGCGGATCCTGGATATTCrrcCTCAGCACCTGCTCCCTTC 

TGCTGCGTGTTCTTGGCa^GCACTGGCAAAGGACCCAGAAGCTGAGATCCCAAGC^ 

GTTTGATCATCTCCTTCCTGATGGATACCTTGCGGAGCCATCCTAACTCTGAAAGGCTTG 

TTAATGTGGTCTACAACGTGCTTGCCATTATTTCCAGCCAAGGACAGATCTCAGAAGAGC 

TGGAAGAGGAGGGGTTGTTTCAGCTTGCCCAAGAGAACCTGGAGCACTTCCAAGAGGACA 

GGGACATCTGCCTCTCTATCCTGAGCCTGCTCTGGTCCCTCCTGGTAGATGTTGTCACTG 

TGGACAAAGAGCCCTTGGAGCAGCTCTCTGGCATGGTCACCTGGGTGCTGGCTACTCATC 

CGGAGGACGTGGAAATAGCAGAGGCTGGCTGTGCGGTGCTCTGGCTGCTGTCCTTGTTGG 

GCTGCATAAAGGAGAGTCAGTTTGAGCAGGTGGTAGTGCTGCTCCTGAGAAGCATCCAGC 

TGTGCCCTGGCAGAGTACTGCTGGTGAACAATGCATTCCGTGGCTTGGCCAGCCTCGCAA 

AGGTGTCCGAACTGGTGGCCTTCCGAATAGTAGTACTGGAAGAGGGCAGCAGCGGCCTCC 

ACCTCATCCAAGATATCTACAAGCTCTACAAGGATGACCCTGAGGTGGTGGAGAACCTCT 

GCATGCTGTTGGCCCATCTGACCTCCTACAAGGAGATCCTGCCAGAGATGGAGTCTGGAG 

GCATCAAAGACCTAGTCCAGGTGATCCGGGGGCGCTTTACCTCCAGCCTGGAGCTGATTT 

CTTACGCTGATGAGATACTCCAGGTACTGGAAGCAAATGCACAACCTGGCCTCCAGGAGG 

ATCAGCTTGAGCCTCCTGCAGGGCAGGAAGCCCCACTGCAGGGAGAGCCCCTCTTCAGGC 

CCTGACATGCTGCCCTTCTGGTCCTGTGGTAAGAGAAAGTATCACTAGGTCCAGTATTAA 

TTTCGTACCCCATGGTGACTAATAAAAGAAGCCCTAGGCTGTTTCTGGC 

SEQ ID NO: 106_018653 . 9_H 

GGCCGGGGTCGGGGCGCGGGGCATGCGCGCGGGCTGGGCAGGGGGCCGGCGGGGCGCAGA 
GCGGAGCCGCCTCGGAGCCTGAGCCGCCCGGGGCCGGGGCCGGGGAGCCGCGCGGGGCCG 
GCCGGCCGGGGGGAGGGGAGCGATGCGGCGCCGGCGGGCGGCAGTGGCCGCGGGTTTCTG 
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FIGURE 2CCCC 

CGCCTCCTTCCTGCTGGGCTCCGTCCTCAACGTGCTCTTCGCTCCGGGTCGGAGCCTCCG 

AGGCCAGGCCAGTCCCCTGAGCCTTCGCCGGCCCCGGGTGCGGGCCGTCGCGGGGGCCGC 

GGGGAGCTGGCCCGGCAGATCCGGGCGCGCTACGAGGAGGTGCAGCGCTATTCCCGCGGG 

GGGCCCGGGCCCGGGGCGGGCCGGCCGGAGCGGCGGCGCCTGATGGACCTGGCTCCGGGC 

GGGCCCGGCCTGCCGCGCCCCCGGCCCCCTTGGGCCCGGCCCCTGTCCGACGGCGCCCCA 

GGCTGGCCCCCGGCTCCCGGCCCAGGCTCCCCCGGCCCGGGCCCGCGCCTGGGCTGCGCC 

GCGCTTCGCAACGTGTCCGGCGCGCAGTACATGGGCTCAGGCTACACCAAGGCCGTGTAC 

CGGGTCCGCCTGCCCGGCGGTGCCGCGGTGGCGCTCAAGGCGGTGGACTTTAGCGGCCAC 

GATCTGGGCAGCTGCGTGCGCGAGTTCGGGGTACGGAGGGGCTGCTATCGGCTGGCGGCC 

CACAAGCTGCTTAAGGAGATGGTGCTGCTGGAGCGGCTGCGGCACCCCAACGTGCTGCAG 

CTCTATGGCTACTGCTACCAGGACAGCGAGGACATCCCAGACACCCTGACCACCA^ 

GAGCTGGGCGCCCCTGTAGAAATGATCCAGCTGCTGCAAACTTCCTGGGAGGATCGATTC 

CGAATCTGCCTGAGCCTGGGCCGCCTCCTCCACCACCTGGCCCACTCCCCACTGGGCTCC 

GtCT^CTCTGCTGGACTTCCGCCCTCGGCAGTTTGTGCTGGTGGATGGGGAGCTCAJ^ 

ACGGACCTGGATGACGCACGTGTGGAGGAGACGCCGTGTGCAGGCAGCACCGACTGCATA 

CTCGAGTTTCCGGCCAGGAACTTCACCCTGCCCTGCTCAGCCCAGGGCrGGTGCGAGG^ 

ATGAACGAGAAGCGGAACCTCTATAATGCCTACAGGTTTTTCTTCACATACCTCCTGCCT 

CACAGTGCCCCGCCTTCTVCTGCGTCCTCTGCTGGACAGCATCGTC^CGCCACAGGAGAG 

CTCGCCTGGGGGGTGGACGAGACCCTGGCCCAGCTGGAGAAGGTGCTGCACCTGTACCGG 

AGCGGGCAGTATCTGCAGAACTCCACGGCAAGCAGCAGTACCGAGTACCAGTGTATCCCA 

GAC7VGCACCATCCCCCAGGAAGACTACCGCTGCTGGCCATCCTACCACCACGGGAGCTGC 

CTCCTTTCAGTGTTCAACCTGGCTGAGGCTGTGGATGTCTGTGAGAGCCATGCCCAGTGT 

CGGGCCTTTGTGGTCTICCTUVCCAGACCT^CCTGGACAGGTCGGCAGCTGGTCTTI^ 

ACTGGATGGAGCCAAGTGGTCCCTGATCCCAACAAGACCACATATGTGAAGGCCTCTGGC 

TGACCTATCTGAGGGCTCGGCTGACCAGCTGACTATCCTCAGCAGCTGGGCTTGCCTGTG 

GAGGGAGTGACTTGCACTGGCAGOVCTGCATGTCACCTGGGAACCCCTGCAGACAA^ 

AACATCCCAGACAGACAGATGTGACCAGGACAAACGTGCAATAATGCCAAATGTTAAAAT 

GTGAGTTTACCAGCCTAGCTATGGGACTGCTGGCTCCTAGTCCAGGAATCATGGGGGTAT 

GACTGCCTCTCCAACCCTGTGGGCTGTAAGCAAGCTCAGGCTAGTCTCCCCACTGGGGGC 

TGTGCCCCTCCCTGGGACGGTTCCGTGGGCAGCCCCATCACTGTGTTCAATAGTGTGAGA 

ATGTAGCTAAAGCCCCTGCTGCTGCTGCTGCACATGCCACAGCAGGCGGTGGGGGCTGCG 

TGGGGACAATCCATCGTGGAGTGTTCTCTCAGCTTAGGTCTGGACTIGGAGACTTGGCC^ 

AGATGCTCCAGGATGTGGGTGATTCTGTACCTGGGGAGGCTATCTCTGACCTCCCGACAG 

GGGACACTCCCAGGCCAGCCCAGGGGTCAGGGGCT^GAGGTGCACACOT 

AGACTGGGGTCAGGGAGCAGGTGTGGTTTGAGCCAGGACCTGGGGCGGGGGTGGGGCCGG 

GGCCTTTCTGCCTCATTTGCTTTCAATGAAAGCCTCAAAGCAGCCAAAACCAGGCTTTCC 

CCCTTCCTCGAGTTTGAATATCCAGAATCTTTTGTACTTCTTGTTGGTTAAATTGTTTAT 

TTTTGTAAAAAATAAAATAAAATTAGTTAATAAAATGATGTTTC7VCAGCAAACTC 

T 

SEQ ID NO: 107_AA396601_M 

CCACGCGTCCGGGCTGCGCCGCGCTCCGCAACGTGTCTGGCGCGCAGTACGTGGGCTCAG 

GCTACACTAAGGCTGTGTACCGGGTCCGCCTGCCCGGCGGCGCCGCGGTGGCGCTTAAAG 

CAGTGGACTTCAGCGGCCACGATCTGGGCAGCTGCGTGCGCGAGTTCGGGGCGCGAAGGG 

GCTGCTATCGCCTGGCGGCCCACAAGCTGCTCAAAGAGATGGTGCTGCTGGAGCGGCTGC 

GGCACCCCTUVCGTGCTGCAGCTCTATGGCTATTGCTACCyVGGACAGTGAGGGCATCCCA 

ACACGCTGACCACCATCACAGAGCTGGGTGCCCCTGTGGAGATGATCCAGCTGTTGCAGA 

CTTCCTGGGAGGATCGATTCCGAATCTGCCTCAGCCTTGGCCGCCTCCTCCACCACCTGG 

CCCACTCCCCGCTGGGCTCGGTCACCCTGCTTGACTTCCGCCCTCGGCAGTTTGTGCTAG 

TGAACGGGGAGCTGAAAGTGACAGACCTGGATGATGCCCGCGTGGAAGAGACACCGTGCA 
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nCURE 2DDDD 

CCAGCAGTGCCGACTGCACGCTAGAGTTTCCAGCCAGGAACTTCAGCCTGCCCTGCTCGG 
CCCAGGGCTGGTGCGAGGGCATGAATGAGAAACGGAACCTCTACAATGCCTACAGGTTCT 
TCTTCACATACCTCCTGCCACACAGTGCCCCGCCTTCCCTCCGACCTCTCCTGGATAGCA 
TCGTCAATGCCACGGGAGAGCTCGCCTGGGGGGTGGATGAGACCCTGGCCCAGCTGGAGA 
CAGCGCTACACTTGTTCCGAAGTGGGCAGTACCTGCTVGAACTCTACiVVGCAGCAGGGCTG 
AGTACCAGCGCATCCCGGACAGTGCCATCACACAGGAGGACTATCGCTGCTGGCCATCCT 
ATCACCACGGCGGCTGCCTCCTGTCCGTGTTCAACCTGGCTGAGGCTATAGATGTCTGTG 
AGAGCCATGCTCAGTGTCGTGCCTTTGTGGTCACCAACCAGACCACCTGGACAGGTCGGA 
AGCTGGTCTTTTTTAAGACTGGATGGAACCAAGTGGTCCCTGATGCCGGCAAGACCACAT 
ATGTGAAGGCCCCTGGTTGACTGGTTGTGGGCTCAGCTGACCAGCTGGGCTTGCCTGCTG 
CAGGCGTGACTTGCATCCCACCTGGGAACCCCTGCAGACAAAAGCTAGCTCCCAGAGCAA 
CTGATGTGACCAGGACAAAACGTGCAATATGCAAAAATGTTAAAATGTGAGTTTGCCAGC 
TTCAGTCCCAGACTGGTTGGAACCGGATTGCCTCTCTGGAGCTGTAGGCTGTGAGCAGGG 
CTCAGGCTGGTCTTAACTGGGACAGTCCCGTGGGCAGCCCATTACTGCATTCATGCTTTG 
AGAATGTAGCCAGAACACTGCTGCTGCATAAGCCACCGTGGGCAGGAGCTGCCTGGGGAC 
AACCAGTCTCAGAGTGCTCTCTCAGCTCAGCTCCGCTCCAAATGGAGAGCGCGGGATGCG 
GAGATGTGAGTGAACCAGCACTGGGAAGAAGGCTCTCGGGCCTCTCCCTAGAGGTTGCTC 
CTAGGCCAGCCCCGAGGCCGTGGGCAGCAGTGCTCGCATCCATATGAGCCAAGACTAGAG 
TGGAGGAGCAGATTGCATTTGAGCCAGGACTGGGGTGGGGGTAGGGTCGGGGCCTCTCTG 
CCTCATTTGCTTTCT^GTGAAAGCCAGGGAGCAGCCGCAGCCAGGCTCCTCCCACTCCTGG 
AGGCCAGGCTCCTCCCCCTCCTGGAGGCCAGGCTCCTCCCCCTCCTGGAGTTTGCGTACC 
CAGAAGCTTTTATACTTCTCGTTCATTAAATTGTTTATTTTTGTAAAAAAAAAAATTAAT 
CAATTAATAAAATGATGTTTTGTGAC 

SEQ ID NO: 108_VRK3_H 

ATGATCTCCTTCTGTCCAGACTGTGGCAAAAGTATCCAAGCGGCATTCAAATTCTGCCCC 

TACTGTGGAAATTCTTTGCCTGTAGAGGAGCATGTAGGGTCCCAGACCTTTGTCAATCCA 

CATGTGTCATCCTTCaU^GGCTCAAAGAGAGGGCTGAACTCCAGTTTTGAAACCTCTCCT 

AAGAAAGTGAAATGGTCCAGCACCGTCACCTCTCCCCGATTATCCCTCTTCTCAGATGGT 

GAGAGTTCTGAGTCTGAAGATACTCTGAGTTCCTCTGAGAGATCCAAAGGCTCCGGGAGC 

AGACCCCCAACCCCCAAAAGCAGCCCTCAGAAGACCAGGAAGAGCCCTCAGGTGACCAGG 

GGTAGCCCTCAGAAGACCAGCTGTAGCCCTCAGAAGACCAGGCAGAGCCCTCAGACGCTG 

AAGCGGAGCCGAGTGACCACCTCACTTGAAGCTTTGCCCACAGGGACAGTGCTGACAGAC 

AAGAGTGGGCGACAGTGGAAGCTGAAGTCCTTCCAGACCAGGGACAACCAGGGCATTCTC 

TATGAAGCTGCACCCACCTCCACCCTCACCTGTGACTCAGGACCACAGAAGCAAAAGTTC 

TCACTCAAACTGGATGCCAAGGATGGGCGCTTGTTCT^TGAGCAGAACTTCTTCCAG 

GCCGCCAAGCCTCTGCAAGTCAACAAGTGGAAGAAGCTGTACTCGACCCCACTGCTGGCC 

ATCCCTACCTGCATGGGTTTCGGTGTTCACCAGGACAAATACAGGTTCTTGGTGTTACCC 

AGCCTGGGGAGGAGCCTTCAGTCGGCCCTGGATGTCAGCCCAAAGCATGTGCTGTCAGAG 

AGGTCTGTGCTGCAGGTGGCCTGCCGGCTGCTGGATGCCCTGGAGTTCCTCCATGAGAAT 

GAGTATGTTCATGGAAATGTGACAGCTGAAAATATCTTTGTGGATCCAGAGGACCAGAGT 

CAGGTGACTTTGGCAGGCTATGGCTTCGCCTTCCGCTATTGCCCAAGTGGCAAACACGTG 

GCCTACGTGGAAGGCAGCAGGAGCCCTCACGAGGGGGACCTTGAGTTCATTAGCATGGAC 

CTGCACAAGGGATGCGGGCCCTCCCGCCGCAGCGACCTCCAGAGCCTGGGCTACTGCATG 

CTGAAGTGGCTCTACGGGTTTCTGCCATGGACAAATTGCCTTCCCAACACTGAGGACATC 

ATGAAGCAAAAACAGAAGTTTGTTGATAAGCCGGGGCCCTTCGTGGGACCCTGCGGTCAC 

TGGATCAGGCCCTCAGAGACCCTGCAGAAGTACCTGAAGGTGGTGATGGCCCTCACGTAT 

GAGGAGAAGCCGCCCTACGCCATGCTGAGGAACAACCTAGAAGCTTTGCTGCAGGATCTG 

CGTGTGTCTCCATATGACCCCATTGGCCTCCCGATGGTGCCCTAG 



WEST 
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FIGURE 2EEEE 
SEQ ID NO: 109_S71575_M VRK3_M 

CCATCCCCACCTGTATCGGCTTTGGCATTCACCAGGACAAGTACAGGTTCCTAGTATTCC 
CCAGCCTGGGGAGGAGCCTTCAGTCAGCCCTGGATGACAACCCAAAGCATGTGGTATCAG 
AGAGATGTGTGCTTCAGGTGGCCTGCAGGCTGCTGGATGCTCTGGAGTATCTCCATGAAA 
ATGAGTATGTTCACGGGAACCTGACAGCTGAGAATGTCTTTGTGAATCCAGAGGATCTGA 
GCCAGGTGACCCTGGTGGGCTATGGCTTCACCTACCGATACTGCCCAGGTGGCAAACACG 

tggcctacaaagaaggcagcaggagtccacacgatggggacttggagttcattagcatgg 

acctgcacaagggatgcggaccctcccgccgcagcgatctccagaccttgggctactgta 

tgctcaagtggctttatgggtccctgccatggacaaattgccttcccaacaccgaa;^ 

taactaggcagaagcagaagtatctggacagccccgagcgcctcgtgggactgtgtggcc 

gctggaacaaggcctcagagaccctgcgggagtacctgaaggtggtgatggccctcaatt 

atgaggagaagccaccctatgccacgctgaggaacagcctagaagctctgctgcaggata 

tgcgggtgtcaccctatgaccctctggacctccagatggtgccttagatggaatccagag 

cttccgacttgcagcttgaagtagaacatgaagtagtgtgactggaggcctgtttgaact 

catagctcctaaaagaatcccttgaatgtgcattctcaccgctcccttaggacatatgaa 

tcagcacttgtgttggggaacctgagtcatgtcatgtaatgtgaaactcctccctgtctc 

agctctggcagctgtggatggaggtaagtggatgctggcggcggcggcggcagcagccac 

TCCACTCCCTATGGCATTTCTGTGATGGCATAATAAACTGTTTTTAATC 
SEQ ID NO: 110__AA4 5427_H 

ATGGGCCACGCGCTGTGTGTCTGCTCTCGGGGAACTGTCATCATTGACAATAAGCGCTAC 
CTCTTCATCCAGAAACTGGGGGAGGGTGGGTTCAGCTATGTGGACCTAGTGGAAGGGTTA 
CATGATGGACACTTCTACGCCCTGAAGCGAATCCTGTGTCACGAGCAGCT^GGACCGGGAG 
GAGGCCCAGCGAGAAGCCGACATGCATCGCCTCTTCAATCACCCCAACATCCTTCGCCTC 
GTGGCTTACT6TCTGAGGGAACGGGGTGCTAAGCATGAGGCCTGGCTGCTGCTACCATTC 
TTCAAGAGAGGTACGCTGTGGAATGAGATAGAAAGGCTGAAGGACAAAGGCT^CTTCCTG 
ACCGAGGATCAAATCCTTTGGCTGCTGCTGGGGATCTGCAGAGGCCTIGAGGCCATTCAT 
GCCAAGGGTTATGCCCACAGAGACTTGAAGCCCACCAATATATTGCTTGGAGATGAGGGG 
CAGCCAGTTTTAATGGACTTGGGTTCCATGAATCAAGCATGCATCCATGTGGAGGGCTCG 
CGCCAGGCTCTGACCCTGCAGGACTGGGCAGCCCAGCGGTGCACCATCTCCTACCGAGCC 
CCAGAGCTCTTCTCTGTGCAGAGTCACTGTGTCATCGATGAGCGGACTGATGTCTGGTCC 
CTAGGCTGCGTGCTATATGCCATGATGTTTGGGGAAGGCCCTTATGACATGGTGTTCCAA 
AAGGGTGACAGTGTGGCCCTTGCTGTGCAGAACCAACTCAGCATCCCACAAAGCCCCAGG 
CATTCTTCAGCTITTGCGGCAGCTCCTGAACTCGATGATGACCGTGGACCCGCATCAGCGT 
CCTCACATTCCTCTCCTCCTCAGTCAGCTGGAGGCGCTGCAGCCCCCAGCTCCTGGCCAA 
CATACTACCCAAATCTGA 

SEQ ID NO: 111_H05721_H 

CCCTGAGGCACCGCCCCAAGTTTGGTGTGACCGGCGGGGGACGCCGGTGGTGGCGGCAGC 
GACGGCTGCGGGGGCACCGGGCCGCGGCGCCACCATGGCGGTGCGACAGGCGCTGGGCCG 
CGGCCTGCAGCTGGGTCGAGCGCTGCTGCTGCGCTTCACGGGCAAGCCCGGCCGGGCCTA 
CGGCTTGGGGCGGCCGGGCCCGGCGGCGGGCTGTGTCCGCGGGGAGCGTCCAGGCTGGGC 
CGCAGGACCGGGCGCGGAGCCTCGCAGGGTCGGGCTCGGGCTCCCTAACCGTCTCCGCTT 
CTTCCGCCAGTCGGTGGCCGGGCTGGCGGCGCGGTTGCAGCGGCAGTTCGTGGTGCGGGC 
CTGGGGCTGCGC6GGCCCTTGCGGCCGGGCAGTCTTTCTGGCCTTCGGGCTAGGGCTGGG 
CCTCATCGAGGAAAAACAGGCGGAGAGCCGGCGGGCGGTCTCGGCCTGTCAGGAGATCCA 
GGCAATTTTTACCCAGAAAAGCAAGCCGGGGCCTGACCCGTTGGACACGAGACGCTTGCA 
GGGCTTTCGGCTGGAGGAGTATCTGATAGGGCAGTCCATTGGTAAGGGCTGCAGTGCTGC 
TGTGTATGAAGCCACCATGCCTACATTGCCCCAGAACCTGGAGGTGAC/iAAGAGCACCGG 
GTTGCTTCCAGGGAGAGGCCCAGGTACCAGTGCACCAGGAGAAGGGCAGGAGCGAGCTCC 
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FIGURE 2FFFF 

GGGGGCCCCTGCCTTCCCCTTGGCCATCAAGATGATGTGGAACATCTCGGCAGGTTCCTC 

CAGCGAAGCCATCTTGAACACAATGAGCCAGGAGCTGGTCCCAGCGAGCCGAGTGGCCTT 

GGCTGGGGAGTATGGAGCAGTCACTTACAGAAAATCCAAGAGAGGTCCCAAGCAACTAGC 

CCCTCACCCCAACATCATCCGGGTTCTCCGCGCCTTCACCTCTTCCGTGCCGCTGCTGCC 

AGGGGCCCTGGTCGACTACCCTGATGTGCTGCCCTCACGCCTCCACCCTGAAGGCCTGGG 

CCATGGCCGGACGCTGTTCCTCGTTATGAAGAACTATCCCTGTACCCTGCGCCAGTACCT 

TTGTGTGAACACACCCAGCCCCCGCCTCGCCGCCATGATGCTGCTGCAGCTGCTGGAAGG 

CGTGGACCATCTGGTTCAACAGGGCATCGCGCACAGAGACCTGAAATCCGACAACATCCT 

TGTGGAGCTGGACCCAGACGGCTGCCCCTGGCTGGTGATCGCAGATTTTGGCTGCTGCCT 

GGCTGATGAGAGCATCGGCCTGCAGTTGCCCTTCAGCAGCTGGTACGTGGATCGGGGCGG 

AAACGGCTGTCTGATGGCCCCAGAGGTGTCCACGGCCCGTCCTGGCCCCAGGGCAGTGAT 

TGACTACAGCAAGGCTGATGCCTGGGCAGTGGGAGCCATCGCCTATGAAATCTTCGGGCT 

TGTCAATCCCTTCTACGGCCAGGGCAAGGCCCACCTTGAAAGCCGCAGCTACCT^GAGGC 

TCAGCTACCTGCACTGCCCGAGTCAGTGCCTCCAGACGTGAGACAGTTGGTGAGGGCACT 

GCTCCy^GCGAGAGGCCAGCAAGAGACCATCTGCCCGAGTAGCCGCAAATGTGCTTCATCT 

AAGCCTCTGGGGTGAACATATTCTAGCCCTGAAGAATCTGAAGTTAGACAAGATGGTTGG 

CTGGCTCCTCCAACTU^TCGGCCGCCACTTTGTTGGCCTyvCAGGCTCACA 

TGTGGAAACAAAAATGAAGATGCTCTTTCTGGCTAACCTGGAGTGTGAAACGCTCTGCCA 

GGCAGCCCTCCTCCTCTGCTCATGGAGGGCAGCCCTGTGATGTCCCTGCATGGAGCTGGT 

GAATTACTAAAAGAACATGGCATCCTCTGTGTCGTGATGGTCTGTGAATGGTGAGGGTGG 

GAGTCAGGAGACAAGACAGC6CAGAGAGGGCTGGTTAGCCGGAAAAGGCCTCGGGCTTGG 

CAAATGGAAGAACTTGAGTGAGAGTTCAGTCTGCAGTCCTCTGCTCACAGACATCTGAAA 

AGTGAATGGCCAAGCTGGTCTAGTAGATGAGGCTGGACTGAGGAGGGGTAGGCCTGCATC 

CACAGAGAGGATCCAGGCCAAGGCACTGGCTGTCAGTGGCAGAGTTTGGCTGTGACCTT 

GCCCCTAACACGAGGAACTCGTTTGAAGGGGGCAGCGTAGCATGTCTGATTTGCCACCTG 

GATGAAGGCAGACTVTCAACATGGGTCAGCTVCGTTCAGTTACGGGAGTGGGAAAT^ 

AGGCCTGGGCCTCTGCGTTCCCAAGCTGTGCGTTCTGGACCAGCTACTGAATTATTAATC 

TCACTTAGCGAAAGTGACGGATGAGCAGTAAGTAAGTAAGTGTGGGGATTTAAACTTGAG 

GGTTTCCCTCCTGACTAGCCTCTCTTACAGGAATTGTGAAATATTAAATGCAAATTTACA 

ACTGCAGATGACGTATGTGCCTTGAACTGAATATTTGGCTTTAAGAATGATTCTTCTTAT 

ACTCTGAAGGTGAGAATATTTTGTGGGCAGGTATCAACATTGGGGAAGAGATTTCATGTC 

TAACTAACTAACTTTATACATGATTTTTAGGAAGCTATTGCCTAAATCAGCGTCAACATG 

CAGTAAAGGTTGTCTTCAACTGACAAAA 

SEQ ID NO: 112_AI086865__H 

AATGAGATGGAGAAGTACGAGCGGATCCGAGTGGTGGGGAGAGGTGCCTTCGGGATTGTG 
CACCTGTGCCTGCGAAAGGCTGACCAGAAGCTGGTGATCATCAAGCAGATTCCAGTGGAA 
CAGATGACCAAGGAAGAGCGGCAGGCAGCCCAGAATGAGTGCCAGGTCCTCAAGCTGCTC 
AACCACCCCAATGTC7^TTGAGTACTACGAGAACTTCCTGGAAGAC7\AAGCCCTTATGATC 
GCCATGGAATATGCACCAGGCGGCACTCTGGCTGAGTTCATCCAAAAGCGCTGTAATTCC 
CTGCTGGAGGAGGAGACCATCCTGCACTTCTTCGTGCAGATCCTGCTTGCACTGCATCAT 
GTGCACACCCACCTCATCCTGCACCGAGACCTC7VAGACCCAGAACATCCTGCTTGACAAA 
CACCGCATGGTCGTCAAGATCGGTGATTTCGGCATCTCCAAGATCCTTAGCAGCAAGAGC 
ACCCCATGCTATATCTCCCCTGAGCTGTGTGAGGGCAAGCCCTACAACCAGAAGAGTGAC 
ATCTGGGCCCTGGGCTGTGTCCTCTACGAGCTGGCCAGCCfCAAGAGGGCTTTCGAGGCT 
GCGAACTTGCCAGCACTGGTGCTGAAGATCATGAGTGGCACCTTTGCACCTATCTCTGAC 
CGGTACAGCCCTGAGCTTCGCCAGCTGGTCCTGAGTCTACTCAGCCTGGAGCCTGCCCAG 
CGGCCACCACTCAGCCACATCATGGCACAGCCCCTCTGCATCCGTGCCCTCCTCAACCTC 
CACACCGACGGCAGAGAAGTCCGTGGCCCCCAGCAACACAGGGAGCAGGACCACCAGTGT 
CCGCTGCAGAGAGGCATCATCATGACTVTTCGGCAGCGGCAGCT^TGGGTGCCTAGGCCAT 
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FIGURE 2GGGG 



GGCAGCCTCT^CTGACATCAGCCAGCCCACCATTGTGGAGGCTTTGTTGGGCTATGAAATG 

GTGCAGCAAGTGGAGGAGGCCCTGAGCTTCACACTACTAGGCTCTGCACCCCTGGACCAG 

GAGCCTCTGCTGAGTATAGACCTGGGCACTGCTCACTCAGCTGCTGTGACTGGTGAGGAG 

GACTTGGGCTCTGGAGATGTAAACAGGTTACCCAGCTGGGAGAGAGGACATCTGCTGGCT 

GGTGTGGCGTCCAGCACTGATGTGTCTACCTTCTCTGAAGGTGACTGCAAGGAGCCTGAC 

AAGTGCTGCTGGAGACACAAGCAGTGCACTGGGCACATCATCTACCCTTTCGCCTCTGAC 

TGTGTCCGCCACAGCCTGCACCTACACTCTGTCAACCACTGCAACTGTAATTCTAGGCTG 

AAGGACTCTTCAGAGGATAGCAGCAGCTCCCGGGGCGCGGGCCCAACCTGCrCCCATGTC 

ATCGAGTCCCCTTGCTTTGAGCTCACACCGGAGGAGGAGCATGTGGAGCGATTCCGGTAT 

GGCTGGTGCAAAAGCTACAGACCTGTCTCTGTGGCAGTGATCCACCATCCACTCTACCAT 

GAGTGTGGGGCAGATGATCTAAATGXXAAGAAGAGGAAGAGGAGGAGGAGGAT^GCAAG 

CCCCCCATCCCGACACAGGTGGGGCCCGCCACCGCCTCCCCTGACCTAGGCACCAGCATG 

GCCACTGGTACCCCTGACTCCACAGCGCCCATCACCATCTGGCGCTCTGAGAGCCCCACA 

GGGAAGGGTCAGGGCAGCAAGGTGATC1?^GAAGGTAAAGAAGAAAAAGGAAAAAGAGAA^ 

GACAAGGAGGAGATGGATGAGAAGGCTVAAGCTGAAGAAAAAAGCCAAGAAAGGCCAGTT 

ACTAAGAAGAAAAGCCCGGTTAAATTGGAGCCTTCCCCGCCAGACGTGAGCCGATCATTA 

AGCGCAAGACAGCTGGCCAGGATGTCCGAGTCCAGCCCAGAAAGCCGGGAAGAGCTGGAG 

AGCGAGGACAGTTACAATGGCCGGGGGCAGGGAGAACTGTCCAGCGAGGATATTGTGGAA 

TCATCATCGCCCAGGAAGAGAGAGAACACAGTCCAGGCCAAAAAGACAGGGGCAAAGCCC 

TCACAAGCCAGGAAGGTAAACAAGAGAAAATCTCCCCCAGGATCAAACCCCAACCTCAGT 

TGAGGCCAGGGTGGTCAGGGTGCAGAATAAATGCCATCGAGCCTGTGGCTGGCCCTCTGC 

TGCTGTTCTCTCCCTCCAACCTGGCTGTTTCTTGCGGGGCAAGGGGTGGGCTCAGGGCTG 

CAGGGGTTTCTCAT^GGCAATCCAGCTTTCACT^GGAAGCCCATGGG^ 

AGGGAAAGGAAGGGGCACAGCCCTATTTCTTCCTACCTGCTAGGACAAGGTGGAAGAGTG 

TATCTGGGGTGGGAAGGAGGGCTTCCCCTCTCTGCTGCGAGAGACTGGTCTGTGTGAAAT 

CCACTTCTGGGACAGGCAGTACTGTCTGCAGCGATACCCCCAAT7VAACGGAACTTTTTAA 

CCC 

SEQ ID NO: 113_AA836348_H 

ATGTCGGTGCTGGGCGAGTACGAGCGACACTGCGATTCCATCAACTCGGACTTTGGGAGC 
GAGTCCGGGGGTTGCGGGGACTCGAGTCCGGGGCCTAGCGCCAGTCAGGGGCCGCGAGCC 
GGCGGCGGCGCGGCGGAGCAGGAGGAACTGCACTACATCCCCATCCGCGTCCTGGGCCGC 
GGCGCCTTCGGGGAAGCCACGCTGTACCGCCGCACCGAGGATGACTCACTGGTTGTGTGG 
AAGGAAGTCGATTTGACCCGGCTGTCTGAGAAGGAACGTCGTGATGCCTTGAATGAGATA 
GTTATTCTGGCACTGCTGCT^GCACGACAACATTATTGCCTACTACAATCACTTCATGGAC 
AATACCACGCTGCTGATTGAGCTGGAATATTGTAATGGAGGGAACCTGTATGACAAAATC 
CTTCGTCAGAAGGACAAGTTGTTTGAGGAAGAGATGGTGGTGTGGTACCTATTTCAGATT 
GTTTCAGCAGTGAGCTGCATCCATAAAGCTGGAATCCTTCATAGAGATATAAAGACATTA 
AATATTTTTCTGACCAAGGCAAACCTGATAAAACTTGGAGATTATGGCCTAGCAAAGAAA 
CTTAATTCTGAGTATTCCATGGCTGAGACGCTTGTGGGAACCCCATATTACATGTCTCCA 
GAGCTCTGTCAAGGAGTAAAGTACAATTTCAAGTCTGATATCTGGGCAGTTGGCTGCGTC 
ATTTTTGAACTGCTTACCTTAAAGAGGACGTTTGATGCTACAAACCCACTTAACCTGTGT 
GTGAAGATCGTGCAAGGAATTCGGGCCATGGAAGTTGACTCTAGCCAGTACTCTTTGGAA 
TTGATCCAAATGGTTCATTCGTGCCTTGACCAGGATCCTGAGCAGAGACCTACTGCAGAT 
GAACTTCTAGATCGCCCTCTTCTCAGGAAACGCAGGAGGTCAAGCACTGTGACTGAAGCA 
CCCATTGCTGTAGTAACATCACGAACCAGTGAAGTCTATGTTTGGGGTGGTGGAAAATCC 
ACCCCCCAGAAACTGGATGTTATCAAGAGTGGCTGTAGTGCCCGGCAGGTCTGTGCAGGG 
AATACCCACTTTGCTGTGGTCACAGTGGAGAAGGAACTGTACACTTGGGTGAACATGCAA 
GGAGGCACTAAACTCCATGGTCAGCTGGGCCATGGAGACAAAGCCTCCTATCGACAGCCA 
AAGCATGTGGAAAAGTTGCAAGGCAAAGCTATCCGTCAGGTGTCATGTGGTGATGATTTC 
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nCURE 2HHHH 

ACTGTCTGTGTGACTGATGAGGGTCAGCTCTATGCCTTCGGATCAGATTATTATGGCTGC 

ATGGGGGTGGACAAAGTTGCTGGCCCTGAAGTGCTAGAACCCATGCAGCTGAACTTCTTC 

CTCAGCAATCCAGTGGAGCAGGTCTCCTGTGGAGATAATCATGTGGTGGTTCTGACACGA 

AACAAGGAAGTCTATTCTTGGGGCTGTGGCGAATATGGACGACTGGGTTTGGATTCAGAA 

GAGGATTATTATACACCACAAAAGGTGGATGTTCCCAAGGCCTTGATTATTGTTGCAGTT 

CT^TGTGGCTGTGATGGGACATTTCTGTTGACCCAGTCAGGCAAAGTGCTGGCCTGTGGA 

CTCAATGAATTCAATAAGCTGGGTCTGAATCAGTGCATGTCGGGAATTATCAACCATGAA 

GCATACC7VTGAAGTTCCCTACAC7VACGTCCTTTACCTTGGCCAAACAGTTGTCCTTTTAT 

AAGATCCGTACCATTGCCCCAGGCAAGACTCACACAGCTGCTATTGATGAGCGAGGCCGG 

CTGCTGACCTTTGGCTGCTVACAAGTGTGGGCAGCTGGGCGTTGGGAACTACAAGAAG^ 

CTGGGAATCAACCTGTTGGGGGGACCCCTTGGTGGGAAGCAAGTGATCAGGGTCTCCTGC 

GGTGATGAGTTTACCATTGCTGCCACTGATGAGAAAGTATTGAATTCTAAGACCATCCGT 

TCCAATAGCAGTGGCTTATCCT^TTGGAACTGTGTTTCAGAGCTCTAGCCCGGGAGGAGGC 

GGCGGGGGCGGCGGTGGTGAAGAAGAGGACAGTCAGCAGGAATCTGAAACTCCTGACCCA 

AGTGGAGGCTTCCGAGGAACAATGGAAGCAGACCGAGGAATGGAAGGTTTAATCAGTCCC 

ACAGAGGCCATGGGGAACAGTAATGGGGCCAGCAGCTCCTGTCCTGGCTGGCTTCGAAAG 

GAGCTGGAAAATGCAGAATTTATCCCCATGCCTGACAGCCCATCTCCTCTCAGTGCAGCG 

TTTTCAGAATCTGAGAAAGATACCCTGCCCTATGAAGAGCTGCAAGGACTCAAAGTGGCC 

TCTGAAGCTCCTTTGGAACTICAAACCCCT^GTAGAAGCCTCGGTAACTGAGCTTTTTGCC 

TTTGAATCACAACTAGTCACCTCGGCTGAATCCTGCAGTAACCTGTGCTGGGAAGGGAAC 

ACCACTGACTCCTCCTGCGTGTGCGTGCAGCTCTCTGCAGGTGGAGGTTGA 

SEQ ID NO: 114_R866€8_H, MKK6_H 

ATGAACTTGCTGCTCTCCTACCGCGATGTGCAGGACTACTCGGCCATCATTGAGCTGGTG 

GAGACGCTGCAGGCCTTGCCCACCTGTGATGTGGCCGAGCAGCATAATGTCTGCTTCCAC 

TACACTTTTGCCCTCAACCGGAGGAACAGGCCTGGGGACCGGGCGAAGGCCCTGTCTGTG 

CTGCTGCCGCTGGTACAGCTTGAGGGCTCTGTGGCGCCCGATCTGTACTGCATGTGTGGC 

CGTATCTACJ^GGACATGTTCTTCAGCTCGGGTTTCCAGGATGCTGGGCACCGGGAGCAG 

GCCTATCACTGGTATCGCAAGGCTTTTGACGTAGAGCCCAGCCTTCACTCAGGCATCAAT 

GCAGCTGTGCTCCTCy^TTGCTGCCGGGCAGCACTTTGAGGATTCCAAAGAGCTCCG^ 

ATAGGCATGAAGCTGGGCTGCCTGCTGGCCCGCAAAGGCTGCGTGGAGAAGATGCAGTAT 

TACTGGGATGTGGGTTTCTACCTGGGAGCCCAGATCCTCGCCAATGACCCCACCCAGGTG 

GTGCTGGCTGCAGAGCAGCTGTATAAGCTCAATGCCCCCATATGGTACCTGGTGTCCGTG 

ATGGAGACCTTCCTGGTCTACCAGCACTTCAGGCCCACGCCAGAGCCCCCTGGAGGGCCA 

CCACGCCGTGCCCACTTCTGGCTCCACTTCTTGCTACAGTCCTGCCAACCATTCAAGACA 

GCCTGTGCCCAGGGCGACCAGTGCTTGGTGCTGGTCCTGGAGATGAACAAGGTGCTGCTG 

CCTGCAAAGCTCGAGGTTCGGGGTACTGACCCAGTAAGCACAGTGACCCTGAGCCTGCTG 

GAGCCTGAGACCCAGGACATTCCCTCCAGCTGGACCTTCCCAGTCGCCTCCATATGCGGA 

GTCAGCGCCTCAAAGCGCGACGAGCGCTGCTGCTTCCTCTATGCACTCCCCCCGGCTCAG 

GACGTCCAGCTGTGCTTCCCCAGCGTAGGGCACTGCCAGTGGTTCTGCGGCCTGATCCAG 

GCCTGGGTGACGAACCCGGATTCCACGGCGCCCGCGGAGGAGGCGGAGGGCGCGGGGGAG 

ATGTTGGAGTTTGATTATGAGTACACGGAGACGGGCGAGCGGCTGGTGCTGGGCAAGGGC 

ACGTATGGGGTGGTGTACGCGGGCCGCGATCGCCACACGAGGGTGCGCATCGCCATCAAG 

GAGATCCCGGAGCGGGACAGCAGGTTCTCTCAGCCCCTGCATGAAGAGATCGCTCTTCAC 

AGACGCCTGCGCCACAAGAACATAGTGCGCTATCTGGGCTCAGCTAGCCAGGGCGGCTAC 

CTTAAGATCTTCATGGAGGAAGTGCCTGGAGGCAGCCTGTCCTCCTTGCTGCGGTCGGTG 

TGGGGACCCCTGAAGGACAACGAGAGCACCATCAGTTTCTACACCCGCCAGATCCTGCAG 

GGACTTGGCTACTTGCACGACAACCACATCGTGCACAGGGACATAAAAGGGGACAATGTG 

CTGATCAACACCTTCAGTGGGCTGCTCAAGATTTCTGACTTCGGCACCTCCAAGCGGCTG 

GCAGGCATCACACCTTGOVCTGAGACCTTCTICAGGAACTCTGCAGTATATGGCCCC^ 
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ATCATTGACCAGGGCCCACGCGGGTATGGGAAAGCAGCTGACATCTGGTCACTGGGCTGC 
ACTGTCATTGAGATGGCCACAGGTCGCCCCCCCTTCCACGAGCTCGGGAGCCCTICAGGCT 
GCCATGTTTCAGGTGGGTATGTACAAGGTCCATCCGCCAATGCCCAGCTCTCTGTCGGCC 
GAGGCCCAAGCCTTTCTCCTCCGAACTTTTGAGCCAGACCCCCGCCTCCGAGCCAGCGCC 
CAGACACTGCTGGGGGACCCCTTCCTGCAGCCTGGGAAAAGGAGCCGCAGCCCCAGCTCC 
CCACGACATGCTCCACGGCCCTCAGATGCCCCTTCTGCCAGTCCCACTCCTTCAGCCAAC 
TCAACCACCCAGTCTCAGACATTCCCGTGCCCTCAGGCACCCTCTCAGCACCCACCCAGC 
CCCCCGAAGCGCTGCCTCAGTTATGGGGGCACCAGCCAGCTCCGGGTGCCCGAGGAGCCT 
GCGGCCGAGGAGCCTGCGTCTCCGGAGGAGAGTTCGGGGCTGAGCCTGCTGCACCAGGAG 
AGCAAGCGTCGGGCCATGCTGGCCGCAGTATTGGAGCAGGAGCTGCCAGCGCTGGCGGAG 
AATCTGCACCAGGAGCAGAAGCAAGAGCAGGGGGCCCGTCTGGGCAGAAACCATGTGG^ 
GAGCTGCTGCGCTGCCTCGGGGCACACATCCACACTCCCAACCGCCGGCAGCTCGCCCAG 
GAGCTGCGGGCGCTGCAAGGACGGCTGAGGGCCCAGGGCCTTGGGCCTGCGCTTCTGCAC 
AGACCGCTGTTTGCCTTCCCGGATGCGGTGAAGCAGATCCTCCGCAAGCGCCAGATCCGT 
CCACACTGGATGTTCGTTCTGGACTCACTGCTCAGCCGTGCTGTGCGGGCAGCCCTGGGT 
GTGCTAGGACCGGAGGTGGAGAAGGAGGCGGTCTCACCGAGGTCAGAGGAGCTGAGTAAT 
GAAGGGGACTCCCAGCAGAGCCCAGGCCAGCAGAGCCCGCTTCCGGTGGAGCCCGAGCAG 
GGCCCCGCTCCTCTGATGGTGCAGCTGAGCCTCTTGAGGGCAGAGACTGATCGGCTGCGC 
GAAATCCTGGCGGGGAAGGAACGGGAGTACCAGGCCCTGGTGCAGCGGGCTCTACAGCGG 
CTGAATGAGGAAGCCCGGACCTATGTCCTGGCCCCAGAGCCTCCAACTGCTCTTTCAACG 
GACCAGGGCCTGGTGCAGTGGCTACAGGAACTGAATGTGGATTCAGGCACCATCCAAATG 
CTGTTGAACCATAGCTTCACCCTCCACACTCTGCTCACCTATGCCACTCGAGATGACCTC 
ATCTACACCCGCATCAGGGGAGGGATGGTATGCCGCATCTGGAGGGCCATCTTGGCACAG 
CGAGCAGGATCCACACCAGTCACCTCTGGACCCTGA 

SEQ ID NO: 115_PAK6_H 

ATGTTTGGGAAGAAAAAGAAAAAGATTGAAATATCTGGCCCGTCCAACTTTGAACACAGG 

GTTCATACTGGGTTTGATCCACAAGAGCAGAAGTTTACCGGCCTTCCCCAGCAGTGGCAC 

AGCCTGTTAGCAGATACGGCCAACAGGCCAAAGCCTATGGTGGACCCTTCATGCATCACA 

CCCATCCAGCTGGCTCCTATGAAGACAATCGTTAGAGGAAACAAACCCTGCAAGGAAACC 

TCCATCAACGGCCTGCTAGAGGATTTTGACAACATCTCGGTGACTCGCTCCAACTCCCTA 

AGGAAAGAAAGCCCACCCACCCCAGATCAGGGAGCCTCCAGCCACGGTCCAGGCCACGCG 

GAAGAAAATGGCTTCATCACCTTCTCCCAGTATTCC7VGCGAATCCGATACTACTGCTGAC 

TACACGACCGAAAAGTACAGGGAGAAGAGTCTCTATGGAGATGATCTGGATCCGTATTAT 

AGAGGCAGCCACGCAGCCAAGCAAAATGGGCACGTAATGAAAATGAAGCACGGGGAGGCC 

TACTATTCTGAGGTGAAGCCTTTGAAATCCGATTTTGCCAGATTTTCTGCCGATTATCAC 

TCACATTTGGACTCACTGAGCAAACCTVAGTGAATACAGTGACCTCAAGTGGGAGTAT^ 

AGAGCCTCGAGTAGCTCCCCTCTGGATTATTCATTCCAATTCACACCTTCTAGAACTGCA 

GGGACCAGCGGGTGCTCC7AGGAGAGCCTGGCGTACAGTGAAAGTGAATGGGGACCCAGC 

CTGGATGACTATGACAGGAGGCCAAAGTCTTCGTACCTGAATCAGACAAGCCCTCAGCCC 

ACCATGCGGCAGAGGTCCAGGTCAGGCTCGGGACTCCAGGAACCGATGATGCCATTTGGA 

GCAAGTGCATTTAAAACCCATCCCCAAGGACACTCCTACAACTCCTACACCTACCCTCGC 

TTGTCCGAGCCCACAATGTGCATTCCAAAGGTGGATTACGATCGAGCACAGATGGTCCT^ 

AGCCCTCCACrGTCAGGGTCTGACACCTACCCCAGGGGCCCTGCCAAACTACCTCAAAGT 

CAAAGCAAATCGGGCTATTCCTCAAGCAGTCACCAGTACCCGTCTGGGTACCACAAAGCC 

ACCTTGTACCATCACCCCTCCCTGCAGAGCAGTTCGCAGTACATCTCCACGGCTTCCTAC 

CTGAGCTCCCTCAGCCTCTCATCCAGCACCTACCCGCCGCCCAGCTGGGGCTCCTCCTCC 

GACCAGCAGCCCTCCAGGGTGTCCCATGAACAGTTTCGGGCGGCCCTGCAGCTGGTGGTC 

AGCCCAGGAGACCCCAGGGAATACTTGGCCAACTTTATCAAAATCGGGGAAGGCTCAACC 

GGCATCGTATGCATCGCCACCGAGAAACACACAGGGAAACAAGTTGCAGTGAAGAAAATG 
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GACCTCCGGAAGCAACAGAGACGAGAACTGCTTTTCAATGAGGTCGTGATCATGCGGGAT 
TACCACCATGACAATGTGGTTGACATGTACAGCAGCTACCTTGTCGGCGATGAGCTCTGG 
GTGGTCATGGAGTTTCTAGAAGGTGGTGCCTTGACAGACATTGTGACTCACACCAGAATG 
AATGAAGAACAGATAGCTACTGTCTGCCTGTCAGTTCTGAGAGCTCTCTCCTACGTTCAT 
AACCAAGGAGTGATTCACAGGGACATAAAAAGTGACTCCATCCTCCTGACAAGCGATGGC 
CGGATAAAGTTGTCTGATTTTGGTTTCTGTGCTCAAGTTTCCAAAGAGGTGCCGAAGAGG 
AAATCATTGGTTGGCACTCCCTACTGGATGGCCCCTGAGGTGATTTCTAGGCTACCTTAT 
GGGACAGAGGTGGACATCTGGTCCCTCGGGATCATGGTGATAGAAATGATTGATGGCGAG 
CCCCCCTACTTCAATGAGCCTCCCCTCCAGGCGATGCGGAGGATCCGGGACAGTTTACCT 
CCAAGAGTGAAGGACCTACACAAGGTTTCTTCAGTGCTCCGGGGATTCCTAGACTTGATG 
TTGGTGAGGGAGCCCTCTCAGAGAGCAACAGCCCAGGAACTCCTCGGACATCCATTCTTA 
AAACTAGCAGGTCCACCGTCTTGCATCGTCCCCCTCATGAGACAATACAGGCATCACTGA 

SEQ ID NO: 1 1 6_SURTK1 0 6_H 

ATGAATGATAGGAATGAGATTCAAATGGAAGCCAAACTCCAAAGTCTTACCATTATAGCA 
CAGGAAATTCTATGCAGATTCTTTATTACCCTTAGGAGACATGCACGTTTCCTGCTCACT 
AAACTAGGAAGGCAAGGAATGGCAAGGTCAGGAATTACTCACAGCTGTGCTGTGTGCATT 
CTCTGTGGGCCTAGCAGGGAAGGGGACAGCCCTGTGGCAATGGGCATGACACGGATGCTC 
CTGGAATGCAGTCTCAGTGACAAGTTGTGTGTCATCCAGGAGAAGCAGTATGAAGTGATT 
ATCGTCCCAACTTTGTTGGTTACTATCTTCCTCATCCTTCTTGGGGTCATCCTGTGGCTT 
TTTATCAGAGAACAAAGAACTCAACAGCAGCGTTCTGGACCTCAAGGCATTGCCCCTGTT 
CCTCCACCTAGGGACCTAAGCTGGGAAGCAGGACATGGAGGAAATGTGGCTTTGCCACTT 
7UVGGAGACATCCGTGGAAAACTTTCTGGGAGCTACCACACCTGCCCTGGCTAAGCTGCAG 
GTGCCGCGGGAGCAACTCTCTGAAGTTCTGGAGCAGATTTGCAGTGGTAGCTGTGGGCCC 
ATCTTTCGAGCCAATATGAACACTGGGGACCCTTCTAAGCCCAAGAGTGTTATTCTCAAG 
GCTTTAAAAGAACCAGCTGGGCTCCATGAGGTACAAGATTTCTTAGGGCGAATCCAATTC 
CATCAATACCTGGGGAAACACAAAAACCTGGTGCAGCTGGT^GGCTGCTGCACTGAAAAG 
CTGCCACTCTATATGGTGTTGGAGGATGTGGCCCAGGGGGACCTGCTCGGCTTTCTCTGG 
ACCTGTCGGCGGGATGTGATGACTATGGATGGTCTTCTCTATGATCTCACAGAAAAACAA 
GTATATCACATCGGT^GCAAGTCCTTTTGGCGCTGGAATTCCTGCAGGAGAAGCATTTG 
TTCCATGGGGATGTGGCAGCCAGGAATATTCTGATGCAAAGTGATCTCACTGCTAAGCTC 
TGTGGATTAGGCCTGGCTTATGAAGTTTACACCCGAGGGGCCATCTCCTCTACTCAAACC 
ATACCTCTCAAGTGGCTTGCCCCAGAACGGCTTCTCCTGAGACCTGCTAGCATCAGAGCA 
GATGTCTGGTCTTTTGGGATCCTGCTCTATGAGATGGTGACTCTAGGAGCACCACCGTAT 
CCTGAAGTCCCTCCTACCAGCATCCTAGAGCATCTCCAAAGAAGGAAAATCATGAAGAGA 
CCCAGTAGCTGCACACATACCATGTACAGTATCATGAAGTCCTGCTGGCGCTGGCGTGAG 
GCTGACCGCCCCTCACCTAGAGAGCTGCGCTTGCGCCTAGAAGCTGCCATTAAAACTGCA 
GATGACGAGGCTGTGTTACAAGTACCAGAGTTGGTGGTACCTGAACTGTATGCAGCTGTG 
GCCGGCATCAGAGTGGAGAGCCTCTTCTACAACTATAGCATGCTTTGAAGAGTCTCGGGC 
AAGAAACATTCATGCATGAGTATATGTTCTTGGAATCAATTCCTCTAAGAACAGAGAATG 
GTCTTTCCCAGGGACACAAAGGGAGAAATGGGACATGGATTCTTGATCTTCCTTTACACA 
TTTCTCGGGAAATCTGAAATGATGCTGGATGGGACTCTACACATCCTGAGCTAAGACATA 
CTGTCAGTCTCACTTCTGCTGTCCCAGTCCTAGAAATCCTGGGTAGAAGTGGTGGACCTG 
TGCAAAGGAGGTTTTAGAACTCTGCAGTATTTGTTGGGGCATGGCACAAATAAGCTCATC 
CCTCCCGTCCGAGGCTAGTTTCCTCTGGAACCACATTTTTATCTAGATGAAAATTTGGAA 
TGAAATGAAGGAATAGAAATCCAATAAAAGAGTTGAAGGGAAAGAAAATTTAAGGTTCTT 
CTTGCTCAGGATTACAGATATGGACCAACACCTCCTTCAAGAAAAGGTGGTAGGACACAA 
AGTTCTTCAGTCCTGAGCCCTACATGTGGGGCTGGAGGAGAACTATAACGGAAAAACCTC 
TGAGTTTCACCTTAGGTATAGATAAAAGAAAGATGGTCCCCTTTTATCTGATTCTGAGAC 
AGGTAAATTCTGTTTGTTACTACGTTTAATTAGAAGGTGGAGGAGTCATTTCATGATTAA 
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GAACATTCAACATGTATTGTTCATTAAGCTAGCTTCCTAGTTCCGATTAGACTAAGGAGA 
CTAAGCCTAGAGAGTCAATGTTAGAACAGTGAAAAGAATTCTGTGTGTGTGTGTGTGTGT 
GTGTGTGTGTGCACAATAAATAGGAAATGTAGAAACCAAGCAAGAAGGCTTAGTAGCTCA 
GTCTTTAACAAGGGCTAGAAAAGAATGTAATCTGATATGGAAGGATAGCAGCTTCTAATT 
TTCAATCATCTGTTGATATACTGTGAAACTTATTTTATTAAATTAATATTTATTAAATGG 

SEQ ID NO: 117_AA098024_M 

CTGCAGGAGAAGCT^CCTGTTTCATGGGGATGTGGCTGCCAGGAACATCCTGATCCAAAGT 

GACCTGACTCCCAAACTTTGTCATCTGGGCCTGGCTTATGAAGTTCATGCCCATGGGGCC 

ATCTCCTCTGCTCGATCCAGCACCATCCCTCTCAAGTGGCTTGCTCCAGAAAGGCTTCTC 

CTGAGACCTGCAAGCATCAGGGGAGATATTTGGTCCTTTGGGATCCTGCTTTATGAGATG 

GTGACTCTAGGAGCACCACCATACCCTGAAGTCCCrCCCACCAGCATCCTACT^ 

CAGAGAAAGAAAATCATGAAGAGACCCAGCAGCTGCTCACATGCCATGTACAACATC^ 

AAGTGCTGTTGGCGCTGGAGTGAGGACAGCCGCCCCTTACTTGTTCAGCTGCTCCAGCGC 

CTAGAAGCTGCTTCTAGATCTGCCGATGACAAGGCTGTGTTGCAAGTGCCAGAGTTGGTG 

GTGCCTGAACTGTATGCAGATGTGGCTGGCATCAGGGCAGAAAGCATTTCCTATAGCTTC 

AGTGTCCTTTGAAGATGGTCCTAGACAAATGACTATATATGGGTGGAATTAGTTCCTTCA 

AGAACAGAGAGAAGGAACTTTCTGTGGCCCACC^GGGAGAAAAAAGGACATGGATCTTG 

CATCTTTCCCTAAACATTTTCCTAGACATCTGAAATGCTGCTGGATGAAGCTCTACCTCT 

ACATACCATGTACTCTTGAGCTAAGAATCACCATCAATTGTAGTTTGCTTTCCAGTCCCA 

AGGGCTGAAGTATAAGTGGTGGACCGTGTCATTCTAAAGGAGGTTTTTAAAATCTGCAAT 

GATTGTAAGGGAATTAGGCAAAAGGGCTGGTCCCACTCACTCCAGGCTGGTTTACTACTG 

AAACTAGTTTTTCTTTTCTTTTTTTTTAAGTTAAACTATTACAGAGTAAAAATAAACCAG 

ATGGGCATGAATGAACACCTTCTAATTTTTAACCATGAATTGAATATTGGAATTCATGAG 

AAAGAAAATTCTAGGTTCTTTTTGCTAAGAGGTGTTAAGGTGAGTCAATATATCCTTCAA 

GGAAAGGCTTTGTCTCATCTATGTTGACGGGACGTAAAAGTCCTCGTCCCGTTATGAAGA 

GCTGAAGAAGATCTATAAGAAACAATACTGAGCCTTTCCTTGACTATAGATAGAAGAGCA 

TCCTTTCATTGAACTCTGAGGCAGGTGGACCATGCATGATACTAAGTTTAATTAGAAGCA 

GGAGGAGTCATTTCATGATTAGGAACTVTTGTTCATCCCT^TTTGTTTGCCAGTTCCTGTAA 

GACTAAGGAGAATCAGCCTATAGAGCCAAAGCTAGAACCAGGGATAAAAAGTGTGTGTGT 

GTATAACAAATAGGAAGCATGAAAGTCGAGCAAGAAGACTTAGTAACCCAGGTGGTCATT 

AAGAGGTACAGAGAAGAAGTAATCTTATAGGAATGGATGGTAGCTTCTAATTTTTAACCA 

TTCATTGAAATAACTGTGAAGCAACTCATTAAACTAGTATTTATTGACCAAAAGTAGACT 

TTTCAGGTGTATAGCTGCCAAAATCTCTATAATAAAGAGGCTAAAAGAAAATAT^TGGGA 

GTTATTTTACTAGGAAAATTAGAGAACCTATAGTTTCCAT^AAAGAGATTCTTTATGTGC^ 

AAATGAGATAACTCTCTACCTCACAGGGTTGGTGTGAGGAACAATGAGAATATGTATTTG 

TGTATTATGTAGAATATAATATATTCTCAATAAATACTAGTTTTTCCCCTTTC 

SEQ ID NO: 118_SGK2ALPHA_H 

GAAGAGGGCAGAGCCGTGCATGGGGCTGCTCCCCAGGACCTGAGCAGGAACCTGGAGTTT 
TCAGAGCTGCCTGATCATTGCTACAGAATGAACTCTAGCCCAGCTGGGACCCCAAGTCCA 
CAGCCCTCCAGGGCCAATGGGAACATCAACCTGGGGCCTTCAGCCAACCCAAATGCCCAG 
CCC^CGGACTTCGACTTCCTCAAAGTCATCGGCAAAGGGAACTACGGGAAGGTCCTACTG 
GCCTAGCGCAAGTCTGATGGGGCGTTCTATGCAGTGAAGGTACTACAGAAAAAGTCCATC 
TTAAAGAAGAAAGAGCAGAGCCACATCATGGCAGAGCGCAGTGTGCTTCTGAAGAACGTG 
CGGCACCCCTTCCTCGTGGGCCTGCGCTACTCCTTCCAGACACCTGAGAAGCTCTACTTC 
GTGCTCGACTATGTCAACGGGGGAGAGCTCTTCTTCCACCTGCAGCGGGAGCGCCGGTTC 
CTGGAGCCCCGGGCCAGGTTCTACGCTGCTGAGGTGGCCAGCGCCATTGGCTACCTGCAC 
TCCCTCAACATCATTTACAGGGATCTGAAACCAGAGAACATTCTCTTGGACTGCCAGGGA 
CACGTGGTGCTGACGGATTTTGGCCTCTGCAAGGAAGGTGTAGAGCCTGAAGACACCACA 
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TCCACATTCTGTGGTACCCCTGAGTACTTGGCACCTGAAGTGCTTCGGAAAGAGCCTTAT 
GATCGAGCAGTGGACTGGTGGTGCTTGGGGGCAGTCCTCTACGAGATGCTCCATGGCCTG 
CCGCCCTTCTACAGCCAAGATGTATCCCAGATGTATGAGAACATTCTGCACCAGCCGCTA 
CAGATCCCCGGAGGCCGGACAGTGGCCGCCTGTGACCTCCTGCAAAGCCTTCTCCACAAG 
GACCAGAGGCAGCGGCTGGGCTCCAAAGCAGACTTTCTTGAGATTAAGAACCATGTATTC 
TTCAGCCCCATAAACTGGGATGACCTGTACCACAAGAGGCTAACTCCACCCTTCAACCCA 
AATGTGACAGGACCTGCTGACTTGAAGCATTTTGACCCAGAGTTCACCCAGGAAGCTGTG 
TCCAAGTCCATTGGCTGTACCCCTGACACTGTGGCCAGCAGCTCTGGGGCCTCAAGTGCA 
TTCCTGGGATTTTCTTATGCGCCAGAGGATGATGACATCTTGGATTGCTAGAAGAGAAGG 
ACCTGTGAAACTACTGAGGCCAGCTGGTATTAGTAAGGAATTACCTTCAGCTGCTAGGAA 
GAGCGACTCT^AACTAACAATGGCTTCAACGAGAAGCAGGTTT^ 

AAGAAAAATAATGTTTCGGAGTCCAGGACTGGCAGGACAGGTCATCAGATACTCAGAGGC 

TGTATCTCTGCCCTGCCAACCTTGACAAATGGCTTCCAATGTTAGGTTTGCTACAAGATG 

GTTACTGGAGCTCTAGCTGCCTATTTTGTGTTTAGGGAAGGGAAAATGGAGGAAAGGGGA 

GAAGAGCAAAGGGCGCTTTTAAAGAGCTTTCCCAAAAGCTCCCCCCAATGACTTTTGCTT 

CCATCTCACTAACCACCCACCCCTACCTGGAATGGAGGCTGGGAAATGTGGCTTATTTGC 

TGGGTACGTGACTATCCCTAATAACAAAGGGGTTTTGACCCTAAGACATTAGGGGAGAAT 

GTTGGGTAGGCAGCCAGCCCTCTTTTACCATAGGGCCTCCTGGTGTTTGGATTTTGATCT 

CAATGTGTAAAATGACAGAGATGTAACAAGCTCATAGGGTATCAATATCTCTTATTG 

TATGTTGAAAAA 

SEQ ID NO: 120_CCRK__H 

ATGGACCAGTACTGCATCCTGGGCCGCATCGGGGAGGGCGCCCACGGCATCGTCTTCAAG 

GCCAAGCACGTGGAGACTGGCGAGATAATTGCCCTCAAGAAGGTGGCCCTAAGGCGGTTG 

GAAGACGGCTTCCCTAACCAGGCCCTGCGGGAGATTAAGGCTCTGCAGGAGATGGAGGAC 

AATCAGTATGTGGTACAACTGAAGGCTGTGTTCCCACACGGTGGAGGCTTTGTGCTGGCC 

TTTGAGTTCATGCTGTCGGATCTGGCCGAGGTGGTGCGCCATGCCCAGAGGCCACTAGCC 

CAGGCACAGGTCAAGAGCTACCTGCAGATGCTGCTC7VAGGGTGTCGCCTTCTGCCATG 

AACAACATTGTACATCGGGACCTGAAACCTGCCAACCTGCTCATCAGCGCCTCAGGCCAG 

CTCAAGATAGCGGACTTTGGCCTGGCTCGAGTCTTTTCCCCAGACGGCAGCCGCCTCTAC 

ACACACCAGGTGGCCACCAGGTCTGTGGGCTGCATCATGGGGGAGCTGTTGAATGGGTCC 

CCCCTTTTCCCGGGCAAGAACGATATTGAACAGCTTTGCTATGTGCTTCGCATCTTGGGC 

ACCCCAAACCCTCAAGTCTGGCCGGAGCTCACTGAGCTGCCGGACTACAACAAGATCTCC 

TTTAAGGAGCAGGTGCCCATGCCCCTGGAGGAGGTGCTGCCTGACGTCTCTCCCCAGGCA 

TTGGATCTGCTGGGTCAATTCCTTCTCTACCCTCCTCACCAGCGCATCGCAGCTTCCAA 

GCTCTCCTCCATCAGTACTTCTTCACAGCTCCCCTGCCTGCCCATCCATCTGAGCTGCCG 

ATTCCTCAGCGTCTAGGGGGACCTGCCCCCAAGGCCCATCCAGGGCCCCCCCACATCCAT 

GACTTCCACGTGGACCGGCCTCTTGAGGGAGTCGCTGTTGAACCCAGAGCTGATTCGGCC 

CTTCATCCTGGAGGGGTGAGAAGTTGGCCCTGGTCCCGTCTGCCTGCTCCTCAGGACCAC 

TCAGTCCACCTGTTCCTCTGCCACCTGCCTGGCTTCACCCTCCAAGGCCTCCCCATGGCC 

ACAGTGGGCCCACACCACT^CCTTGCCCCTTAGCCCTTGCGAGGGTTGGTCTCGAGGC^ 

GGTCATGTTCCCAGCCAAGAGTATGAGAACATCCAGTCGAGCAGAGGAGATTCATGGCCT 

GTGCTCGGTGAGCCTTACCTTCTGTGTGCTACTGACGTACCCATCAGGACAGTG^ 

GCTGCCAGTCAAGGCCTGCATATGCAGAATGACGATGCCTGCCTTGGTGCTGCTTCCCCC 

GAGTGCTGCCTCCTGGTCAAGGAGAAGTGCAGAGAGTAA 

SEQ ID NO: 121_TESK2_H 

GAATTCGCGGCCGCTCGACGCTCAGCAGAGCTACCAGCTGCCCTGTTGGCTTCGCTGGTC 
GGATCGTCCTCCTGGCCCCGCCAAACAGGCGAGCGGCCCCGACTGTGGGGCATGGCAGTA 
GTCTCCTCGTTCTCCGCCGCCGCTAGCCTAGCTGAGTCGCCGGCTTCTGCGCTAGGGGCT 
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CCCACCGCCTCCGCAGGCTAAGGAGCCGCTGCCACCAACGAGCTGTGAGGGTTACTATGC 
TCCCTCTTTGCCGCCGTCTCCTCCTCTTGCCCGCGCAGGCACCCCTCTGGCTGCTCAGTC 
CTGCCTCAGTGTCAAACCAGAAGAGAAGTAAAATTCAACAAAAATTTATGTGTGGAGTTC 
CTTCTTAAAAGAAGAAAAAAGTGATTATTTAGACTATGGATCGGAGCAAACGGAATTCAA 
TTGCAGGATTTCCTCCACGTGTGGAGCGTCTTGAAGAGTTTGAAGGAGGTGGTGGAGGAG 
, AAGGAAATGTGAGCCAGGTGGGAAGAGTTTGGCCATCTTCGTATCGAGCTCTTATAAGTG 
CCTTTTCCAGACTGACGCGTTTGGATGATTTCACCTGTGAAAAAATAGGGTCTGGCTTCT 
TTTCTGAAGTGTTO^GGTACGACACCGAGCTTCTGGTCAGGTGATGGCTCTTAAGATGA 
ACACATTGAGCT^GTAACCraGGCAAACATGCTGAAAGAAGTACAGCTCATGAATAGACTCT 
CCCATCCCAACATCCTTAGGTATATCAACTCCGGGAACCTGGAACAGTTGCTAGACAGTA 
ACCTGCATTTGCCTTGGACTGTGAGGGTAAAACTGGCCTATGACATAGCAGTGGGCCTCA 
GCTACCTTCACTTCAAAGGCATTTTTCATCGGGACCTCACATCTAAGAACTGCCTGATAA 
AGAGGGATGAGAATGGTTACTCTGCAGTGGTAGCTGACTTTGGCCTGGCTGAGAAGATCC 
CCGATGTCAGCATGGGGAGTGAGAAGCTGGCCGTGGTGGGTTCCCCATTCTGGATGGCAC 
CTGAGGTTCTCCGAGATGAGCCCTATAATGAAAAGGCAGATGTGTTCrCTTATGGTATCA 
TCCTCTGCGAGATCATCGCCCGCATCCAGGCCGATCCGGACTATCTTCCCCGCACAGAGA 
ATTTCGGGCTGGACTATGATGCTTTCCAGCACATGGTGGGAGACTGTCCCCCAGATTTTC 
TGCAACTTACTTTCAACTGCTGTAACATGGATCCCAAACTGCGCCCATCTTTTGTGGAGA 
TTGGGAAGACCCTGGAGGAAATTCTGAGCCGCCTACAGGAAGAAGAGCAGGAGAGGGATA 
GGAAGCTGCAGCCCACAGCCAGGGGACTCTTGGAGAAAGCACCTGGGGTGAAGCGACTAA 
GCTCACTGGATGACAAGATCCCCCACAAGTCACCATGCCCAAGACGTACCATCTGGCTGT 
CTCGAAGCCAGTCAGATATCTTTTCCCGTAAGCCCCCACGTACAGTGAGTGTCTTGGACC 
CATACTACCGGCCACGAGATGGTGCTGCCCGCACCCCCAAAGTCAACCCTTTTAGTGCTC 
GCCAGGACCTCATGGGGGGCAAGATCAAGTTTTTTGACCTGCCCAGCAAGTCTGTCATCT 
CTCTGGTATTTGACCTGGATGCACCAGGGCCCGGAACTATGCCCCTGGCTGACTGGCAGG 
AGCCCCTGGCCCCACCTATTCGCCGGTGGCGTTCCTTGCCTGGTTCGCCTGAGTTCTTGC 
ATCAAGAGGCTTGTCCATTTGTGGGCCGGGAAGAATCGCTATCTGATGGGCCCCCACCAC 
GCCTAAGTAGTCTCAAGTACAGAGTTAAAGAGATCCCACCATTCCGGGCATCTGCCCTAC 
CAGCTGCTCAAGCCCATGAGGCTATGGACTGCTCCATTCTCCAGGAAGAAAATGGTTTTG 
GGTCCAGGCCCCAGGGGACCAGTCCATGCCCTGCGGGTGCTTCTGAGGAGATGGAGGTAG 
AAGAAAGGCCAGCAGGCTCAACTCCAGCCACCTTCTCCACCTCAGGCATAGGCCTGCAAA 
CCCAGGGAAAGCAGGATGGGTGAGGGGGTTTAGTCCCTGCCTCACCTTGGGGATGGACCT 
TCAGCTGAAACCATATGGCCCCCTAGGTGCACAGCCTTGATTCTTCCCTGGAGCCTACAG 
AGCAGGCAGGCTAGGCCAAGCCAGGCTCAACTTCTGGGCTCCCAGTGCCCATTGGCTGTG 
TATGACGGGAGGCAGCAGTGAGAGGCCTTCCTAGTTAGGGCCAACAGCTGATACCAAGCC 
TCTGAAATCCAGCAAGGAGGTCTGCCTCCCACCAGACCCTCTCCAGTGTACTTCCCCAGA 
TAGGACCAGAGGATGTCTAGTTCTAGGCTGAGCTGGCAGGCAGCTATTACCCCGGTTCTT 
TCCCCACCCCAGGTCTGTCTCTTGCCTTTTCTTGGGGCATATAAGCTACTGAGTGGAACA 
TGGAGCTGATCAAGAGGCCGTAATGGTCATGGCTGTTTCCAGACCTGAATATTGGGTGCT 
TCTTGCCAGTATTCTAAGACATTTGAGTAATTGCTGTTTGCACTTACTGCATGGTCAGAC 
CACGTCACTACATTTCTATGCAAGGGGACAGCAAGGCAGCGTGGTGGTCATGGCTCTTAG 
CTAACCTATTCAAAGACCTTTTCCTGTTGATTAATCTATTTTCATATTTATAAAGGAGTC 
TTAATGTTCTGCCCCATAAGACTTTCAACCTTGTGGTTGGGAGTGGGGCTGGTTTTGTAG 
GCCCTAGGGCCTGCTTCTATGTATTTATCAACATGTGATACATTCAATTGGTTAAATGGT 
TTATACAGGGACTGATTTGCTTCCCTTCCTGCCATGGCTGGAGCTTTGGGAACAGTCTGT 
CCTTACAGAGCTGCAATAAGAAATAACCAAAGATGAAGCTGGTCAAATATTTTCATAACT 
TGCTTCTGTTGATTTTTTTTTTGTAAAACTTTCCCAAGACATTTTCAGACTTAAAAATAA 

AGTCAGTGTTACAGGT 
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